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Description 

This is a continuation-in-part of a copending U.S. patent application having serial number 07/576,537, filed August 
31, 1990 and entitled "Glyphosate Tolerant 5-EnolpyruvyJshikimate. 3. Phosphate Synthases." 

s This invention relates in general to plant molecular biology and, more particularly, to a new class of glyphosate 

tolerant 5-enolpyruvylshikimate-3-phosphate synthases. 

Recent advances in genetic engineering have provided the requisite tools to transform plants to contain foreign 
genes. It is now possible to produce plants which have unique characteristics of agronomic importance. Certainly, one 
such advantageous trait is more cost effective, environmentally compatible weed control via herbicide tolerance. Her- 

io bicide-tolerant plants may reduce the need for tillage to control weeds thereby effectively reducing soil erosion. 

One herbicide which is the subject of much investigation in this regard is N-phosphonomethylglycine commonly 
referred to as glyphosate. Glyphosate inhibits the shikimic acid pathway which leads to the biosynthesis of aromatic 
compounds including amino acids, plant hormones and vitamins. Specifically, glyphosate curbs the conversion of phos- 
phoenolpyruvic acid (PEP) and 3-phosphoshikimic acid to 5-enolpyruvyt-3-phosphoshikimic acid by inhibiting the en- 

is zyme 5-enolpyruvylshikimate-3-phosphate synthase (hereinafter referred to as EPSP synthase or EPSPS). 

It has been shown that glyphosate tolerant plants can be produced by inserting into the genome of the plant the 
capacity to produce a higher level of EPSP synthase in the chloroplast of the cell (Shah et al., 1986) which enzyme is 
preferably glyphosate tolerant (Kishore et al. 1988). Variants of the wild-type EPSPS enzyme have been isolated which 
are glyphosate tolerant as a result of alterations in the EPSPS amino acid coding sequence (Kishore and Shah, 1988; 

20 Schulz et al., 1984; Sost et at., 1984; Kishore et al., 1986). These variants typically have a higher Kj for glyphosate 
than the wild-type EPSPS enzyme which confers the glyphosate tolerant phenotype, but these variants are also char- 
acterized by a high K m for PEP which makes the enzyme kinetically less efficient (Kishore and Shah, 1988; Sost et 
al., 1984; Schulz et al., 1984; Kishore et al., 1986); Sost and Amrhein, 1990). For example, the apparent for PEP 
and the apparent K t for glyphosate for the native EPSPS from E. coli are 10 u.M and 0.5 uM while for a glyphosate 

25 tolerant isolate having a single amino acid substitution of an alanine for the glycine at position 96 these values are 220 
u.M and 4.0 mM, respectively. A number of glyphosate tolerant plant variant EPSPS genes have been constructed by 
musagenesis. Again, the glyphosate tolerant EPSPS was impaired due to an increase in the Km for PEP and a slight 
reduction of the V max of the native plant enzyme (Kishore and Shah, 1 9BB) thereby lowering the catalytic efficiency 
(V max /K m ) of the enzyme. Since the kinetic constants of the variant enzymes are impaired with respect to PEP, it has 

30 been proposed that high levels of overproduction of the variant enzyme. 40-80 fold, would be required to maintain 
normal catalytic activity in plants in the presence of glyphosate (Kishore et al., 1988). 

While such variant EPSP synthases have proved useful in obtaining transgenic plants tolerant to glyphosate, it 
would be increasingly beneficial to obtain an EPSP synthase that is highly glyphosate tolerant while still kinetically 
efficient such that the amount of the glyphosate tolerant EPSPS needed to be produced to maintain normal catalytic 

35 activity in the plant is reduced or that improved tolerance be obtained with the same expression level. 

Previous studies have shown that EPSPS enzymes from different sources vary widely with respect to their degree 
of sensitivity to inhibition by glyphosate. A study of plant and bacterial EPSPS enzyme activity as a function of glypho- 
sate concentration showed that there was a very wide range in the degree of sensitivity to glyphosate. The degree of 
sensitivity showed no correlation with any genus or species tested (Schulz et al., 19B5). Insensitivity to glyphosate 

40 inhibition of the activity of the EPSPS from the Pseudomonas sp. PG2982 has also been reported but with no details 
of the studies (Fitzgibbon, 1988). In general, while such natural tolerance has been reported, there is no report sug- 
gesting the kinetic superiority of the naturally occurring bacterial glyphosate tolerant EPSPS enzymes over those of 
mutated EPSPS enzymes nor have any of the genes been characterized. Similarly, there are no reports on the ex- 
pression of naturally glyphosate tolerant EPSPS enzymes in plants to confer glyphosate tolerance. 

45 

SUMMARY OF THE INVENTION 

A DNA molecule comprising DNA encoding a kinetically efficient, glyphosate tolerant EPSP synthase is presented. 
The EPSP synthases of the present invention reduce the amount of overproduction of the EPSPS enzyme in a trans- 

so genie plant necessary for the enzyme to maintain catalytic activity while still conferring glyphosate tolerance. This and 
other EPSP synthases described herein represent a new class of EPSPS enzymes, referred to hereinafter as Class 
II EPSPS enzymes. Class II EPSPS enzymes share little homology to known bacterial or plant EPSPS enzymes and 
exhibit tolerance to glyphosate while maintaining suitable ^ (PEP) ranges. Suitable ranges of K m (PEP) for EPSPS 
for enzymes of the present invention are between 1 -1 50 uM with a more preferred range of between 1 -35 u.M, and a 

55 most preferred range between 2-25 u,M. These kinetic constants are determined under the assay conditions specified 
hereinafter. The V max of the enzyme should preferably be at least 15% of the uninhibited plant enzyme and more 
preferably greater than 25%. An EPSPS of the present invention preferably has a Kj for glyphosate range of between 
25-10000 uM The K/K m ratio should be between 3-500, and more preferably between 6-250. The V max should pref- 
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erably be in the range of 2-100 units/mg (ujrioles/minute.mg at 25°C) and the K m for shikimate-3-phosphate should 
preferably be in the range of 0.1 to 50 uJvl. 

Genes coding for Class II EPSPS enzymes have been isolated from three (3) different bacteria: Agrobacterium 
tumefaciens sp. strain CP4, Achromobacter sp. strain LBAA, and Pseudomonas sp. strain PG2982. The LBAA and 
5 PG2982 Class II EPSPS genes have been determined to be identical and the proteins encoded by these two genes 
are very similar to the CP4 protein and share approximately 84% amino acid identity with it. Class II EPSPS enzymes 
can be readily distinguished from Class I EPSPS's by their inability to react with polyclonal antibodies prepared from 
Class I EPSPS enzymes under conditions where other Class I EPSPS enzymes would readily react with the Class I 
antibodies. 

10 Other Class II EPSPS enzymes can be readily isolated and identified by utilizing a nucleic acid probe from one of 

the Class II EPSPS genes disclosed herein using standard hybridization techniques. Such a probe from the CP4 strain 
has been prepared and utilized to isolate the Class II EPSPS genes from strains LBAA and PG2982. These genes 
may also be adapted for enhanced expression in plants by known methodology. Such a probe has also been used to 
identify homologous genes in bacteria isolated de novo from soil. 

is The Class II EPSPS enzymes are preferably fused to a chloroplast transit peptide (CTP) to target the protein to 

the chloroplasts of the plant into which it may be introduced. Chimeric genes encoding this CTP-Class II EPSPS fusion 
protein may be prepared with an appropriate promoter and 3' potyadenylation site for introduction into a desired plant 
by standard methods. 

Therefore, in one aspect, the present invention provides a new class of EPSP synthases that exhibit a low for 
20 phosphoenolpyruvate (PEP), a high V max /K m ratio, and a high K } for glyphosate such that when introduced into a plant, 
the plant is made glyphosate tolerant such that the catalytic activity of the enzyme and plant metabolism are maintained 
in a substantially normal state. For purposes of this discussion, a highly efficient EPSPS refers to its efficiency in the 
presence of glyphosate. 

In another aspect of the present invention, a double-stranded DNA molecule comprising DNA encoding a Class 
25 || EPSPS enzyme is disclosed. A Class II EPSPS enzyme DNA sequence is disclosed from three sources: Agrobac- 
terium sp. strain designated CP4, Achromobacter sp. strain LBAA and Pseudomonas sp. strain PG2982. 

In a further aspect of the present inventbn, a nucleic acid probe from an EPSPS Class II gene is presented that 
is suitable for use in screening for Class II EPSPS genes in other sources by assaying for the ability of a DNA sequence 
from the other source to hybridize to the probe. 
30 in yet another aspect of the present invention, transgenic plants and transformed plant cells are disclosed that are 

made glyphosate tolerant by the introduction of a Class II EPSPS gene into the plant's genome. 

In a still further aspect of the inventbn, a recombinant, double-stranded DNA molecule comprising in sequence: 

a) a promoter which functions in plant cells to cause the production of an RNA sequence; 
35 b) a structural DNA sequence that causes the production of an RNA sequence which encodes a Class II EPSPS 

enzyme; and 

c) a 3' nontranslated region which functions in plant cells to cause the addition of a stretch of polyadenyl nucleotides 
to the 3' end of the RNA sequence 

40 where the promoter is heterologous with respect to the structural DNA sequence and adapted to cause sufficient 
expression of the fusion polypeptide to enhance the glyphosate tolerance of a plant cell transformed with said DNA 
molecule. 

In still another aspect of the present invention, a method for selectively controlling weeds in a crop field is presented 
by planting crop seeds or crop plants transformed with a Class II EPSPS gene to confer glyphosate tolerance to the 
45 plants which allows for glyphosate containing herbicides to be applied to the crop to selectively kill the glyphosate 
sensitive weeds, but not the crops. 

Other and further objects, advantages and aspects of the invention will become apparent from the accompanying 
drawing figures and the description of the invention. 

50 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the DNA sequence (SEQID NO:1) for the full-length promoter of figwort mosaic virus (FMV35S). 
Figure 2 shows the cosmid cloning vector pMON17020. 

Figure 3 shows the structural DNA sequence (SEQ ID NO:2) for the Class II EPSPS gene from bacterial isolate 
55 Agrobacterium sp. strain CP4 and the deduced amino acid sequence (SEQ ID NO:3). 

Figure 4 shows the structural DNA sequence (SEQ ID NO:4) for the Class II EPSPS gene from the bacterial isolate 
Achromobacter sp. strain LBAA and the deduced amino acid sequence (SEQ ID NO: 5). 

Figure 5 shows the structural DNA sequence (SEQ ID NO:6) for the Class II EPSPS gene from the bacterial isolate 
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Pseudomonas sp. strain PG2982 and the deduced amino acid sequence (SEQ ID NO:7). 

Figure 6 shows the Bestfit comparison of the E. coii EPSPS amino acid sequence (SEQ ID NO:8) with that for the 
CPA EPSPS (SEQ ID NO:3). 

Figure 7 shows the Bestfit comparison of the CP4 EPSPS amino acid sequence (SEQ ID NO:3) with that for the 
s LBAA EPS PS (SEQ ID NO:5). 

Figure 8 shows the structural DNA sequence (SEQ ID NO:9) for the synthetic CP4 Class II EPSPS gene. ^v>^ 

Figure 9 shows the DNA sequence (SEQ ID NO: 10) of the chloroplast transit peptide (CTP) and encoded amino 
acid sequence (SEQ ID NO:11) derived from the Arabidopsis thaliana EPSPS CTP and containing a Sphl restriction 
site at the chloroplast processing site, hereinafter referred to as CTP2. 
10 Figure 10 shows the DNA sequence (SEQ ID NO: 12) of the chloroplast transit peptide and encoded amino acid 

sequence (SEQ ID NO: 13) derived from the Arabidopsis thaliana EPSPS gene and containing an EcoRI restriction 
site within the mature region of the EPSPS, hereinafter referred to as CTP3. 

Figure 11 shows the DNA sequence (SEQ ID NO: 14) of the chloroplast transit peptide and encoded amino acid 
sequence (SEQ ID NO: 15) derived from the Petunia hybrida EPSPS CTP and containing a Sph\ restriction site at the 
is chloroplast processing site and in which the amino acids at the processing site are changed to -Cys-Met-, hereinafter 
referred to as CTP4. 

Figure 12 shows the DNA sequence (SEQ ID NO: 16) of the chloroplast transit peptide and encoded amino acid 
sequence (SEQ ID NO:17) derived from the Petunia hybrida EPSPS gene with the naturally occurring EcoRI site in 
the mature region of the EPSPS gene, hereinafter referred to as CTP5. 
20 Figure 1 3 shows a plasmid map of CP4 plant transformation/ expression vector pMON1 711 0. 

Figure 14 shows a plasmid map of CP4 synthetic EPSPS gene plant transformation/expression vector 
pMON17131. 

Figure 15 shows a plasmid map of CP4 EPSPS free DNA plant transformation expression vector pMON13640. 

Figure 16 shows a plasmid map of CP4 plant transformation/direct selection vector pMON17227. 
25 Figure 17 shows a plasmid map of CP4 plant transformation/expression vector pMONl9653. 

The expression of a plant gene which exists in double-stranded DNA form involves synthesis of messenger RNA 
(mRNA) from one strand of the DNA by RNA polymerase enzyme, and the subsequent processing of the mRNA primary 
transcript inside the nucleus. This processing involves a 3' non-translated region which adds pofyadenylate nucleotides 
to the 3* end of the RNA. 

30 Transcription of DNA into mRNA is regulated by a region of DNA usually referred to as the "promoter ' The promoter 

region contains a sequence of bases that signals RNA polymerase to associate with the DNA, and to initiate the tran- 
scription into mRNA using one of the DNA strands as a template to make a corresponding complementary strand of 
RNA. 

A number of promoters which are active in plant cells have been described in the literature. These include the 

35 nopaline synthase (NOS) and octopine synthase (OCS) promoters (which are carried on tumor-inducing plasmids of 
Agrobacterium tumefaciens), the cauliflower mosaic virus (CaMV) 1 9S and 35S promoters, the light -inducible promoter 
from the small subunit of rib u lose bis-phosphate carboxylase (ssRUBISCO, a very abundant plant polypeptide) and 
the full-length transcript promoter from the figwort mosaic virus (FMV35S). All of these promoters have been used to 
create various types of DNA constructs which have been expressed in plants; see, e.g., PCT publication WO 84/0291 3 

<o (Rogers et al., Monsanto). 

Promoters which are known or are found to cause transcription of DNA in plant cells can be used in the present 
invention. Such promoters may be obtained from a variety of sources such as plants and plant DNA viruses and include, 
but are not limited to, the CaMV35S and FMV35S promoters and promoters isolated from plant genes such as ssRU- 
BISCO genes. As described below, it is preferred that the particular promoter selected should be capable of causing 

45 sufficient expressbn to result in the production of an effective amount of a Class II EPSPS to render the plant sub- 
stantially tolerant to glyphosate herbicides. The amount of Class II EPSPS needed to induce the desired tolerance 
may vary with the plant species. It is preferred that the promoters utilized have relatively high expressbn in all men's- 
tematic tissues in addition to other tissues inasmuch as it is now known that glyphosate is translocated and accumulated 
in this type of plant tissue. Alternatively, a combination of chimeric genes can be used to cumulatively result in the 

50 necessary overall expression level of the selected Class II EPSPS enzyme to result in the glyphosate tolerant pheno- 
type. 

The mRNA produced by a DNA construct of the present invention also contains a 5'non -translated leader sequence. 
This sequence can be derived from the promoter selected to express the gene, and can be specifically modified so as 
to increase translation of the mRNA. The 5' non-translated regions can also be obtained from viral RNAs, from suitable 
55 eukaryotic genes, or from a synthetic gene sequence. The present invention is not limited to constructs, as presented 
in the following examples, wherein the non-translated region is derived from both the 5'non-translated sequence that 
accompanies the promoter sequence and part of the 5' non4ranslated region of the virus coat protein gene. Rather, 
the non -translated leader sequence can be derived from an unrelated promoter or coding sequence as discussed 
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above. 

A preferred promoter for use in the present invention is the full-length transcript (SEQ ID NO:1 ) promoter from the 
figwort mosaic virus (FMV35S) which functions as a strong and uniform promoter with particularly good expression in 
meristematic tissue for chimeric genes inserted into plants, particularly dicotyledons. The resulting transgenic plant in 

s general expresses the protein encoded by the inserted gene at a higher and more uniform level throughout the tissues 
and cells of the transformed plant than the same gene driven by an enhanced CaMV35S promoter Referring to Figure 
1, the DNA sequence (SEQ ID NO:1) of the FMV35S promoter is located between nucleotides 6368 and 6930 of the 
FMV genome. A 5' non-translated leader sequence is preferably coupled with the promoter. The leader sequence can 
be from the FMV35S genome itself or can be from a source other than FMV35S. 

10 The 3' non-translated region of the chimeric plant gene contains a polyadenylation signal which functions in plants 

to cause the addition of polyadenylate nucleotides to the 3' end of the viral RNA. Examples of suitable 3'regions are 
(1 ) the 3' transcribed, non-translated regions containing the polyadenylated signal of Agrobacterium tumor-inducing 
(Ti) plasmid genes, such as the nopaline synthase (NOS) gene, and (2) plant genes like the soybean storage protein 
genes and the small subunit of the ribulose-1 ,5-bisphosphate carboxylase (ssRUBISCO) gene. An example of a pre- 

'5 f erred 3'region is that from the ssRUBISCO gene from pea (E9), described in greater detail below. 

The DNA constructs of the present invention also contain a structural coding sequence in double-stranded DNA 
form which encodes a glyphosate tolerant, highly efficient Class II EPSPS enzyme. 

Identification of glyphosate tolerant, highly efficient EPSPS enzymes 

20 

In an attempt to identify and isolate glyphosate tolerant, highly efficient EPSPS enzymes, kinetic analysis of the 
EPSPS enzymes from a number of bacteria exhibiting tolerance to glyphosate or that had been isolated from suitable 
sources was undertaken. It was discovered that in some cases the EPSPS enzymes showed no tolerance to inhibition 
by glyphosate and it was concluded that the tolerance phenotype of the bacterium was due to an impermeability to 

? s glyphosate or other factors. In a number of cases, however, microorganisms were identified whose EPSPS enzyme 
showed a greater degree of tolerance to inhibition by glyphosate and that displayed a low ^ for PEP when compared 
to that previously reported for other microbial and plant sources. The EPSPS enzymes from these microorganisms 
were then subjected to further study and analysis. 

Table I displays the data obtained for the EPSPS enzymes identified and isolated as a result of the above described 

30 analysis. Table I includes data for three identified Class II EPSPS enzymes that were observed to have a high tolerance 
to inhibition to glyphosate and a low K m for PEP as well as data for the native Petunia EPSPS and a glyphosate tolerant 
variant of the Petunia EPSPS referred to as G A1 01 . The G A1 01 variant is so named because it exhibits the substitution 
of an alanine residue for a glycine residue at position 101 (with respect to Petunia) in the invariant region. When the 
change introduced into the Petunia EPSPS (GA101) was introduced into a number of other EPSPS enzymes, similar 

35 changes in kinetics were observed, an elevation of the Kj for glyphosate and of the K m for PEP. 



Table I 



Kinetic characterization of EPSPS enzymes 


ENZYME SOURCE 


K m PEP (u.M) 


Kj Glyphosate QaM) 


K/K m 


Petunia 


5 


0.4 


0.08 


Petunia GA101 


200 


2000 


10 


PG2982 


2.1-3.1 1 


25-82 


-8-40 


LBAA 


~-7.3-8 2 


60(est) 


-7.9 


CP4 


123 


2720 


227 



Range of PEP tested = 1-40 jiM 
Range of PEP tested = 5-80 \iM 
Range of PEP tested = 1 .5-40 \iM 



The Agrobacterium sp. strain CP4 was initially identified by its ability to grow on glyphosate as a carbon source 
(10 mM) in the presence of 1 mM phosphate. The strain CP4 was identified from a collection obtained from a fixed- 
bed immobilized cell column that employed Mannville R-635 diatomaceous earth beads. The column had been run for 
three months on a waste-water feed from a glyphosate production plant. The column contained 50 mg/ml glyphosate 
and NH 3 as NH 4 CI. Total organic carbon was 300 mg/ml and BOD's (Biological Oxygen Demand - a measure of "soft" 
carbon availability) were less than 30 mg/ml. This treatment column has been described (Heitkamp et al., 1 990). Dwor- 
kin-Foster minimal salts medium containing glyphosate at 10 mM and with phosphate at 1 mM was used to select for 
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microbes from a wash of this column that were capable of growing on glyphosate as sole carbon source. Dworkin- 
Foster minimal medium was made up by combining in 1 liter (with autoclaved H 2 0), 1 ml each of A, B and C and 10 
ml of D (as per below) and thiamine HCI (5 mg). 



5 


A. 


D-F Salts (1000X stock; per 100 ml; autoclaved): 








H 3 B0 3 


1 mg 






MnS0 4 .7H 2 0 


1 mg 






ZnS0 4 .7H 2 0 


12.5 mg 


10 




CuS0 4 .5H 2 0 


8mg 






NaMo0 3 .3H 2 0 


1.7 mg 




B. 


F6SO4.7H20 (1000X stock; per 100 ml; autoclaved) 




15 






0.1 g 




C. 


MgS0 4 .7H 2 0 (1000X stock; per 100 ml; autoclaved) 










20 g 


20 


D. 


(NH 4 ) 2 S0 4 (100X stock; per 100 ml; autoclaved) 










20 g 



Yeast Extract (YE; Difco) was added to a final concentration of 0.01 or 0.001%. The strain CP4 was also grown 
on media composed of D-F salts, amended as described above, containing glucose, gluconate and citrate (each at 
0.1 %) as carbon sources and with inorganic phosphate (0.2 - 1.0 mM) as the phosphorous source. 

Other Class II EPSPS containing microorganisms were identified as Achromobactor sp. strain LBAA, which was 
from a collection of bacteria previously described (Hallas et al., 1988), and Pseudomonas sp. strain PG2982 which 
has been described in the literature (Moore et al. 1983; Fitzgibbon 1988). It had been reported previously, from meas- 
urements in crude lysates, that the EPSPS enzyme from strain PG2982 was less sensitive to inhibition to glyphosate 
than that of E. coli, but there has been no report of the details of this lack of sensitivity and there has been no report 
on the K m for PEP for this enzyme or of the DN A sequence for the gene for this enzyme (Fitzgibbon, 1 988; Fitzgibbon 
and Braymer, 1990). 



Relationship of the Class II EPSPS to those previously studied 

35 

All EPSPS proteins studied to date have shown a remarkable degree of homology. For example, bacterial and 
plant EPSPS's are about 54% identical and with similarity as high as 80%. Within bacterial EPSPS's and plant EPSPS's 
themselves the degree of identity and similarity is much greater (see Table II). 

<o Table II 



Comparison between exemplary Class I EPSPS protein sequences 1 




similarity 


identity 


E. coli vs. S. typhimurium 


93.0 


88.3 


P hybrida vs. E. coli 


71.9 


54.5 


P. hybrida vs. Tomato 


92.8 


88.2 



1 The EPSPS sequences compared hero were obtained from the following references: £ coli, Rogers et al., 1983; S. typhimurium, Stalker et al., 
1985; Petunia hybrida, Shah et al., 198$; and Tomato, Gasser et al., 1988. 



When crude extracts of CP4 and LBAA bacteria (50 u.g protein) were probed using rabbit anti-EPSPS antibody 
(Padgette et al. 1 987) to the Petunia EPSPS protein in a Western analysis, no positive signal could be detected, even 
with extended exposure times (Protein A - 125 l development system) and under conditions where the control EPSPS 
(Petunia EPSPS, 20 ng; a Class I EPSPS) was readily detected. The presence of EPSPS activity in these extracts 
was confirmed by enzyme assay. This surprising resuft, indicating a lack of similarity between the EPSPS's from these 
bacterial isolates and those previously studied, coupled with the combination of a low for PEP and a high Kj for 
glyphosate, illustrates that these new EPSPS enzymes are different from known EPSPS enzymes (now referred to as 
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Class I EPSPS). 

Glyphosate Tolerant Enzymes in Microbial Isolates 

5 For clarity and brevity of disclosure, the following description of the isolation of genes encoding Class II EPSPS 

enzymes is directed to the isolation of such a gene from a bacterial isolate. Those skilled in the art will recognize that 
the same or similar strategy can be utilized to isolate such genes from other microbial isolates, plant or fungal sources. 

Cloning of the Aarobacterium sp. strain CP4 EPSPS Gene(s) in E coli 

10 

Having established the existence of a suitable EPSPS in Agrobacterium sp. strain CP4, two parallel approaches 
were undertaken to clone the gene: cloning based on the expected phenotype for a glyphosate tolerant EPSPS; and 
purification of the enzyme to provide material to raise antibodies and to obtain amino acid sequences from the protein 
to facilitate the verification of clones. Cloning and genetic techniques, unless otherwise indicated, are generally those 

is described in Maniatis et al., 1982 or Sambrook et al., 1987. The cloning strategy was as follows: introduction of a 
cosmid bank of strain Agrobacterium sp. strain CP4 into E coli and selection for the EPSPS gene by selection for 
growth on inhibitory concentrations of glyphosate. 

Chromosomal DNA was prepared from strain Agrobacterium sp. strain CP4 as follows: The cell pellet from a 200 
ml L-Broth (Miller, 1 972), late log phase culture of Agrobacterium sp. strain CP4 was resuspended in 10 ml of Solution 

20 |; 50 mM Glucose, 10 mM EDTA, 25 mM Tris -CL pH 8.0 (Birnboim and Doly, 1979). SDS was added to a final con- 
centration of 1% and the suspension was subjected to three freeze-thaw cycles, each consisting of immersion in dry 
ice for 15 minutes and in water at 70°C for 10 minutes. The lysate was then extracted four times with equal volumes 
of phenol: chloroform (1:1; phenol saturated with TE;TE= 10 mM Tris pH8.0; 1.0 mM EDTA) and the phases separated 
by centrifugation (15000g; 10 minutes). The ethanol-precipitable material was pelleted from the supernatant by brief 

25 centrifugation (8000g; 5 minutes) following addition of two volumes of ethanol. The pellet was resuspended in 5 ml TE 
and dialyzed for 16 hours at 4°C against 2 liters TE. This preparation yielded a 5 ml DNA solution of 552 ucj/ml. 

Partially-restricted DNA was prepared as follows. Three 100 u.g aliquot samples of CP4 DNA were treated for 1 
hour at 37°C with restriction endonuclease tf/ndlll at rates of 4, 2 and 1 enzyme unityig DNA, respectively. The DNA 
samples were pooled, made 0.25 mM with EDTA and extracted with an equal volume of phenokchloroform. Following 

30 the addition of sodium acetate and ethanol, the DNA was precipitated with two volumes of ethanol and pelleted by 
centrifugation (12000 g; 10 minutes). The dried DNA pellet was resuspended in 500 uJ TE and layered on a 10-40% 
Sucrose gradient (in 5% increments of 5.5 ml each) in 0.5 M NaCI, 50 mM Tris pH8.0, 5 mM EDTA. Following centrif- 
ugation for 20 hours at 26,000 rpm in a SW28 rotor, the tubes were punctured and —1 .5 ml fractions collected. Samples 
(20 uJ) of each second fraction were run on 0.7% agarose gel and the size of the DNA determined by comparison with 

35 linearized lambda DNA and Hindll l-digested lambda DNA standards. Fractions containing DNA of 25-35 kb fragments 
were pooled, desalted on AMICON10 columns (7000 rpm; 20 B C; 45 minutes) and concentrated by precipitation. This 
procedure yielded 1 5 u.g of CP4 DNA of the required size. A cosmid bank was constructed using the vector pMON 1 7020. 
This vector, a map of which is presented in Figure 2, is based on the pBR327 replicon and contains the spectinomy- 
cin/streptomycin (Sp r ;spc) resistance gene from Tn7 (Fling et al., 1985), the chloramphenicol resistance gene (CnY; 

40 cat) from Jn9 (Alton et al., 1979), the gene 10 promoter region from phage T7 (Dunn et al., 1983), and the 1.6 kb BgA\ 
phage lambda cos fragment from pHC79 (Hohn and Collins, 1 980). A number of cloning sites are located downstream 
of the cat gene. Since the predominant block to the expression of genes from other microbial sources in E coli appears 
to be at the level of transcription, the use of the T7 promoter and supplying the T7 polymerase in trans from the pGP1 -2 
plasmid (Tabor and Richardson, 1985), enables the expression of large DNA segments of foreign DNA, even those 

45 containing RNA polymerase transcription termination sequences. The expression of the spc gene is impaired by tran- 
scription from the T7 promoter such that only Cm r can be selected in strains containing pGP1 -2. The use of antibiotic 
resistances such as Cm resistance which do not employ a membrane component is preferred due to the observation 
that high level expression of resistance genes that involve a membrane component, i.e. p-lactamase and Amp resist- 
ance, give rise to a glyphosate tolerant phenotype. Presumably, this is due to the exclusion of glyphosate from the cell 

so by the membrane localized resistance protein. It is also preferred that the selectable marker be oriented in the same 
direction as the T7 promoter. 

The vector was then cut with H/ndlll and treated with calf alkaline phosphatase (CAP) in preparation for c toning. 
Vector and target sequences were li gated by combining the following: 



55 


Vector DNA (H/ndlll/CAP) 


3u.g 




Size fractionated CP4 H/ndlll fragments 


1.5 u.g 




10X ligation buffer 


2.2 uJ 
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(continued) 



T4 DNA ligase (New England Biolabs) (400 U/uJ) 1 .0 u.l 



s and adding H 2 0 to 22.0 uJ. This mixture was incubated for 18 hours at 16°C. 10X ligation buffer is 250 mM Tris-HCI, 
pH 8.0; 100 mM MgCI 2 ; 100 mM Dithiothreitol; 2 mM Spermidine. The ligated DNA (5ul) was packaged into lambda 
phage particles (Stratagene; Gigapack Gold) using the manufacturer's procedure. 

A sample (200 u,1) of E. coli HB101 (Boyer and Rolland-Dussoix, 1973) containing the T7 polymerase expression 
plasmid pGP1-2 (Tabor and Richardson, 1985) and grown overnight in L-Broth (with maltose at 0.2% and kanamycin 

10 at 50 u.g/ml) was infected with 50 uJ of the packaged DNA. Transformants were selected at 30°C on M9 (Miller, 1972) 
agar containing kanamycin (50 u,g/ml), chloramphenicol (25 u.g/ml), L-proline (50 u.g/ml), L-leucine (50 u.g/ml) and B1 
(5 u.g/ml), and with glyphosate at 3.0 mM. Aliquot samples were also plated on the same media lacking gtyphosate to 
titer the packaged cosmids. Cosmid transformants were isolated on this latter medium at a rate of — 5 x 10 5 per u.g 
CP4 H/ndlll DNA after 3 days at 30°C. Colonies arose on the glyphosate agar from day 3 until day 15 with a final rate 

75 of — 1 per 200 cosmids. DNA was prepared from 14 glyphosate tolerant clones and, following verification of this phe- 
notype, was transformed into E. coli GB100/pGP1-2 (£. co//GB100 is an aroA derivative of MM294 [Talmadge and 
Gilbert, 1 980]) and tested for complementation for growth in the absence of added aromatic amino acids and ami- 
nobenzoic acids. Other aroA strains such as SR481 (Bachman et al. 1980; Padgette et al., 1987), could be used and 
would be suitable for this experiment. The use of GB100 is merely exemplary and should not be viewed in a limiting 

20 sense. This aroA strain usually requires that growth media be supplemented with L-phenylalanine, L-tyrosine and L- 
tryptophan each at 100 u.g/ml and with para-hydroxybenzoic acid, 2,3-dihydroxybenzoic acid and para-aminobenzoic 
acid each at 5 u.g/ml for growth in minimal media. Of the fourteen cosmids tested only one showed complementation 
of the aroA- phenotype. Transformants of this cosmid, pMON17076, showed weak but uniform growth on the unsup- 
plemented minimal media after 10 days. 

25 The proteins encoded by the cosmids were determined in vivo using a T7 expression system (Tabor and Richard- 

son, 1985). Cultures of E. coli containing pGP1-2 (Tabor and Richardson, 1985) and test and control cosmids were 
grown at 30°C in L-broth (2 ml) with chloramphenicol and kanamycin (25 and 50u.g/m), respectively) to a Klett reading 
of ~ 50. An aliquot was removed and the cells collected by cent rifugat ion, washed with M9 salts (Miller, 1972) and 
resuspended in 1 ml M9 medium containing glucose at 0.2%, thiamine at 20 u.g/ml and containing the 18 amino acids 

30 at 0.01 % (minus cysteine and methionine). Following incubation at 30°C for 90 minutes, the cultures were transferred 
to a 42°C water bath and held there for 1 5 minutes. Rifampicin (Sigma) was added to 200 |ig/ml and the cultures held 
at 42° C for 10 additional minutes and then transferred to 30° C for 20 minutes. Samples were pulsed with 10 u.Ci of 35 S- 
methionine for 5 minutes at 30°C. The cells were collected by centrifugation and suspended in 60-120 uJ cracking 
buffer (60 mM Tris-HCI 6.8, 1%SDS, 1%2-mercaptoethanol, 1 0% glycerol, 0.01% bromophenol blue). Aliquot samples 

35 were elect rophoresed on 12.5% SDS-PAGE and following soaking for 60 minutes in 10 volumes of Acetic Acid-Meth- 
anol-water (10:30:60), the gel was soaked in EN LIGHTNING™ (DUPONT) following manufacturer's directions, dried, 
and exposed at -70°C to X-Ray film. Proteins of about 45 kd in size, labeled with 35 S-methionine, were detected in 
number of the cosmids, including pMON17076. 

40 Purification of EPSPS from Agrobacterium sp. strain CP4 

All protein purification procedures were carried out at 3-5°C. EPSPS enzyme assays were performed using either 
the phosphate release or radioactive HPLC method, as previously described in Padgette et al. 1 987, using 1 mM 
phosphoenol pyruvate (PEP, Boehringer) and 2 mM shikimate-3-phosphate (S3P) substrate concentrations. For radi- 

<5 oactive HPLC assays, 14 C-PEP (Amersham) was utilized. S3P was synthesized as previously described in Wibben- 
meyer et al. 1988. N-terminal amino acid sequencing was performed by loading samples onto a Polybrene precycled 
filter in aliquots while drying. Automated Edman degradation chemistry was used to determine the N-terminal protein 
sequence, using an Applied Biosytems Model 470A gas phase sequencer (Hunkapiller et al. 1983) with an Applied 
Biosystems 120A PTH analyzer. 

50 Five 10-litre fermentations were carried out on a spontaneous "smooth" isolate of strain CP4 that displayed less 

clumping when grown in liquid culture. This reduced clumping and smooth colony morphology may be due to reduced 
polysaccharide production by this isolate. In the following section dealing with the purification of the EPSPS enzyme, 
CP4 refers to the "smooth" isolate - CP4-S1. The cells from the three batches showing the highest specific activities 
were pooled. Cell paste of Agrobacterium sp. CP4 (300 g) was washed twice with 0.5 L of 0.9% saline and collected 

55 by centrifugation (30 minutes, 8000 rpm in a GS3 Sorvall rotor). The cell pellet was suspended in 0.9 L extraction buffer 
(100 mM TrisCI, 1 mM EDTA, 1 mM BAM (Benzamidine), 5 mM DTT, 10% glycerol, pH 7.5) and fysed by 2 passes 
through a Manton Gaulin cell. The resulting solution was centrifuged (30 minutes, 8000 rpm) and the supernatant was 
treated with 0.21 L of 1 .5% protamine sulfate (in 1 00 mM TrisCI, pH 7.5, 0.2% w/v final protamine sulfate concentration). 
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After stirring for 1 hour, the mixture was centrifuged (50 minutes, 8000 rpm) and the resulting supernatant treated with 
solid ammonium sulfate to 40% saturation and stirred for 1 hour. After centrifugation (50 minutes, 8000 rpm), the 
resulting supernatant was treated with solid ammonium sulfate to 70% saturation, stirred for 50 minutes, and the in- 
soluble protein was collected by centrifugation (1 hour, 8000 rpm). This 40-70% ammonium sulfate fraction was then 

s dissolved in extraction buffer to give a final volume of 0.2 L, and dialyzed twice (Spectrum 10,000 MW cutoff dialysis 
tubing) against 2 L of extraction buffer for a total of 12 hours. 

To the resulting dialyzed 40-70% ammonium sulfate fraction (0.29 L) was added solid ammonium sulfate to give 
a final concentration of 1 M. This material was loaded (2 ml/min) onto a column (5 cm x 15 cm, 295 ml) packed with 
phenyl Sepharose CL-4B (Pharmacia) resin equilibrated with extraction buffer containing 1 M ammonium sulfate, and 

10 washed with the same buffer (1 .5 L, 2 ml/min). EPSPS was eluted with a linear gradient of extraction buffer going from 
1 M to 0.00 M ammonium sulfate (total volume of 1 .5 L, 2 ml/min). Fractions were collected (20 ml) and assayed for 
EPSPS activity by the phosphate release assay. The fractions with the highest EPSPS activity (fractions 36-50) were 
pooled and dialyzed against 3 x 2 L (18 hours) of 10 mM TrisCI, 25 mM KCI, 1 mM EDTA, 5 mM DTT, 10% glycerol, 
pH 7.8. 

15 The dialyzed EPSPS extract (350 ml) was loaded (5 ml/min) onto a column (2.4 cm x 30 cm, 1 36 ml) packed with 

Q-Sepharose Fast Flow (Pharmacia) resin equilibrated with 10 mM TrisCI, 25 mM KCI, 5 mM DTT, 10% glycerol, pH 
7.8 (Q Sepharose buffer), and washed with 1 L of the same buffer. EPSPS was eluted with a linear gradient of Q 
Sepharose buffer going from 0.025 M to 0.40 M KCI (total volume of 1 .4 L, 5 ml/min). Fractions were collected (1 5 ml) 
and assayed for EPSPS activity by the phosphate release assay. The fractions with the highest EPSPS activity (fractions 

20 47-60) were pooled and the protein was precipitated by adding solid ammonium sulfate to 80% saturation and stirring 
for 1 hour. The precipitated protein was collected by centrifugation (20 minutes, 12000 rpm in a GSA Sorvall rotor), 
dissolved in Q Sepharose buffer (total volume of 14 ml), and dialyzed against the same buffer (2 x 1 L, 18 hours). 

The resulting dialyzed partially purified EPSPS extract (19 ml) was loaded (1.7 ml/min) onto a Mono Q 10/10 
column (Pharmacia) equilibrated with Q Sepharose buffer, and washed with the same buffer (35 ml). EPSPS was 

25 eluted with a linear gradient of 0.025 M to 0.35 M KCI (total volume of 119 ml, 1.7 ml/min). Fractions were collected 
(1 .7 ml) and assayed for EPSPS activity by the phosphate release assay. The fractions with the highest EPSPS activity 
(fractions 30-37) were pooled (6 ml). 

The Mono Q pool was made 1 M in ammonium sulfate by the addition of solid ammonium sulfate and 2 ml aliquots 
were chromatographed on a Phenyl Superose 5/5 column (Pharmacia) equilibrated with 100 mM TrisCI, 5 mM DTT, 1 

30 M ammonium sulfate, 10% glycerol, pH 7.5 (Phenyl Superose buffer). Samples were loaded (1 ml/min), washed with 
Phenyl Superose buffer (10 ml), and eluted with a linear gradient of Phenyl Superose buffer going from 1 M to 0.00 M 
ammonium sulfate (total volume of 60 ml, 1 ml/min). Fractions were collected (1 ml) and assayed for EPSPS activity 
by the phosphate release assay. The fractions from each run with the highest EPSPS activity (fractions -36-40) were 
pooled together (1 0 ml, 2.5 mg protein). For N-terminal amino acid sequence determination, a portion of one fraction 

35 (#39 from run 1) was dialyzed against 50 mM NaHC0 3 (2 x 1 L). The resulting pure EPSPS sample (0.9 ml, 77 u.g 
protein) was found to exhibit a single N-terminal amino acid sequence of: 

XH(G)ASSRPATARKSS(G)LX(GXT)V(R)IPG<DXKXM) (SEQ ID NO:18). 

In this and all amino acid sequences to follow, the standard single letter nomenclature is used. Alt peptide structures 

40 represented in the following description are shown in conventional format wherein the amino group at the N-terminus 
appears to the left and the carboxyl group at the C-terminus at the right. Likewise, amino acid nomenclature for the 
naturally occurring amino acids found in protein is asfolbws: alanine (Ala;A), asparagine (Asn;N), aspartic acid (Asp; 
D), arginine (Arg;R), cysteine (Cys;C), glutamic acid (Glu;E), glutamine (Gln;Q), glycine (Gly;G), histidine (His;H), 
isoleucine (lle;l), leucine (Leu;L), lysine (Lys;K), methionine (Met;M), phenylalanine (Phe;F), proline (Pro;P), serine 

45 (Ser;S), threonine (Thr;T), tryptophan (Trp;W), tyrosine (Tyr;Y), and valine (Val; V). An "X" is used when the amino acid 
residue is unknown and parentheses designate that an unambiguous assignment is not possible and the amino acid 
designation within the parentheses is the most probable estimate based on known information. 

The remaining Phenyl Superose EPSPS pool was dialyzed against 50 mM TrisCI, 2 mM DTT, 10 mM KCI, 10% 
glycerol, pH 7.5 (2 x 1 L). An aliquot (0.55 ml, 0.61 mg protein) was loaded (1 ml/min) onto a Mono Q 5/5 column 

50 (Pharmacia) equilibrated with Q Sepharose buffer, washed with the same buffer (5 ml), and eluted with a linear gradient 
of Q Sepharose buffer going from 0-0.14 M KCI in 10 minutes, then holding at 0.14 M KCI (1 ml/min). Fractions were 
collected (1 ml) and assayed for EPSPS activity by the phosphate release assay and were subjected to SDSPAGE 
(10-15%, Phast System, Pharmacia, with silver staining) to determine protein purity. Fractions exhibiting a single band 
of protein by SDS-PAGE (22-25, 222 u.g) were pooled and dialyzed against 100 mM ammonium bicarbonate, pH 8.1 

55 (2 x 1 L, 9 hours). 
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Trvspinolysis and peptide sequencing of Aprobacterium so strain CP4 EPSPS 

To the resulting pure Ag robacterium sp. strain CP4 EPSPS (111 uxj) was added 3 jag of trypsin (Calbiochem), and 
the trypsinolysis reaction was allowed to proceed for 16 hours at 37°C. The tryptic digest was then chromatographed 
(1mt/min) on a C18 reverse phase HPLC column (Vydac) as previously described in Padgetts et al. 1988 for E. coli 
EPSPS. For all peptide purifications, 0.1% trifluoroacetic acid (TFA, Pierce) was designated buffer "RP-A" and 0.1% 
TFA in acetonitrile was buffer "RP-B". The gradient used for elution of the trypsinized Agrobacterium sp. CP4 EPSPS 
was: 0-8 minutes, 0% RP-B; 8-28 minutes, 0-15% RP-B; 28-40 minutes, 15-21% RP-B; 40-68 minutes, 21-49% RP- 
B; 68-72 minutes, 49-75% RP-B; 72-74 minutes, 75-100% RP-B. Fractions were collected (1 ml) and, based on the 
elution profile at 210 nm, at least 70 distinct peptides were produced from the trypsinized EPSPS. Fractions 40-70 
were evaporated to dryness and redissolved in 150 uJ each of 10% acetonitrile, 0. 1 % trifluoroacetic acid. 

The fraction 61 peptide was further purified on the C18 column by the gradient: 0-5 minutes, 0% RP-B; 5-10 
minutes, 0-38% RP-B; 10-30 minutes, 38-45% B. Fractions were collected based on the UV signal at 210 nm. A large 
peptide peak in fraction 24 eluted at 42% RP-B and was dried down, resuspended as described above, and rechro- 
matographed on the C1 8 column with the gradient: 0-5 minutes, 0% RP-B; 5-12 min, 0-38% RP-B; 12-15 min, 38-39% 
RP-B; 15-18 minutes, 39% RP-B; 18-20 minutes, 39-41% RP-B; 20-24 minutes, 41% RP-B; 24-28 minutes, 42% RP- 
B. The peptide in fraction 25, eluting at 41% RP-B and designated peptide 61-24-25, was subjected to N-terminal 
amino acid sequencing, and the following sequence was determined: 

APSM(IXD)EYPILAV (SEQ ID NO:19). 

The CP4 EPSPS fraction 53 tryptic peptide was further purified by C18 HPLC by the gradient 0% B (5 minutes), 0-30% 
B (5-17 minutes), 30-40% B (17-37 minutes). The peptide in fraction 28, eluting at 34% B and designated peptide 
53-28, was subjected to N-terminal amino acid sequencing, and the following sequence was determined: 

ITGLLEGEDVINTGK (SEQ ID NO: 20). 

In order to verify the CP4 EPSPS cosmid clone, a number of oligonucleotide probes were designed on the basis 
of the sequence of two of the tryptic sequences from the CP4 enzyme (Table III). The probe identified as MID was very 
low degeneracy and was used for initial screening. The probes identified as EDV-C and EDV-T were based on the 
same amino acid sequences and differ in one position (underlined in Table III below) and were used as confirmatory 
probes, with a positive to be expected only from one of these two probes. In the oligonucleotides below, alternate 
acceptable nucleotides at a particular position are designated by a 7" such as A/C/T 

T^MpTI! ft>1<*>torf CP4 EPSPS nentide sequences nnri T)NA nrnhps 

PEPTIDE 61-24-25 APSM(IXD)EYPILAV (SEQ ID NO:19) 
Probe MID; 17-mer, mixed probe; 24-fold degenerate 
ATGATA/C/TGAC/TGAG/ATAC/TCC (SEQ ID NO:21) 

PEPTIDE 53-28 ITGLLEGEDVINTGK (SEQ ID N0:20) 
Probe EDV-C; 17-mer, mixed probe; 48-fold degenerate 
GAA/GGAC/TGTA/C/G/TATA/C/TAACAC (SEQ ID NO:22) 
Probe EDV-T; 17-mer, mixed probe; 48-fold degenerate 
GAA/GGAC/TGTA/C/G/TATA/C/TAA1AC (SEQ ID NO:23) 

The probes were labeled using gamma- 32 P-ATP and polynucleotide kinase. DNA from fourteen of the cosmids 
described above was restricted with EcoRI, transferred to membrane and probed with the olignucleotide probes. The 
conditions used were as follows: prehybridization was carried out in 6X SSC, 10X Denhardfs for 2-18 hour periods at 
60°C, and hybridization was for 48-72 hours in 6X SSC, 10X Denhardt's, 100 u,g/ml tRNA at 10°C below the T d for the 
probe. The T d of the probe was approximated by the formula 2°C x (A+T) + 4°C x (G+C). The filters were then washed 
three times with 6X SSC for ten minutes each at room temperature, dried and autoradiographed. Using the MID probe, 
an -9.9 kb fragment in the pMON 17076 cosmid gave the only positive signal. This cosmid DNA was then probed with 
the EDV-C (SEQ ID NO:22) and EDV-T (SEQ ID NO:23) probes separately and again this -9.9 kb band gave a signal 
and only with the EDV-T probe. 

The combined data on the glyphosate tolerant phenotype, the complementation of the E. coli aroA- phenotype, 
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the expression of a ~45 Kd protein, and the hybridization to two probes derived from the CP4 EPSPS amino acid 
sequence strongly suggested that the pMON17076 cosmid contained the EPSPS gene. 

Localization and subcloning of the CP4 EPSPS gene 

s 

The CP4 EPSPS gene was further localized as follows: a number of additional Southern analyses were carried 
out on different restriction digests of pMON17076 using the MID (SEQ ID NO:21) and EDV-T (SEQ ID NO:23) probes 
separately. Based on these analyses and on subsequent detailed restriction mapping of the pBlueScript (Stratagene) 
subclones of the -9.9 kb fragment from pMON1 7076, a 3.8 kb EcoR\ Sah fragment was identified to which both probes 

10 hybridized. This analysis also showed that MID (SEQ ID NO:2t) and EDV-T (SEQ ID NO:23) probes hybridized to 
different sides of BamHI, Cla\, and Sadl sites. This 3.8 kb fragment was cloned in both orientations in pBlueScript to 
form pMON1 7081 and pMON1 7082. The phenotypes imparted to E coli by these clones were then determined. Glypho- 
sate tolerance was determined following transformation into E coli MM294 containing pGP1-2 (pBlueScript also con- 
tains a T7 promoter) on M9 agar media containing glyphosate at 3 mM. Both pMON17081 and pMON17082 showed 

is glyphosate tolerant colonies at three days at 30°C at about half the size of the controls on the same media lacking 
glyphosate. This, result suggested that the 3.8 kb fragment contained an intact EPSPS gene. The apparent lack of 
orientation-dependence of this phenotype could be explained by the presence of the T7 promoter at one side of the 
cloning sites and the lac promoter at the other. The aroA phenotype was determined in transformants of E coli GB100 
on M9 agar media lacking aromatic supplements. In this experiment, carried out with and without the Plac inducer 

20 IPTG, pMON 17082 showed much greater growth than pMON 17081 , suggesting that the EPSPS gene was expressed 
from the Sali site towards the BcoH site. 

Nucleotide sequencing was begun from a number of restriction site ends, including the Ba/nHI site discussed 
above. Sequences encoding protein sequences that closely matched the N-terminus protein sequence and that for 
the tryptic fragment 53-28 (SEQ ID NO:20) (the basis of the EDV-T probe) (SEQ ID NO:23)were localized to the Sah 

25 side of this BamHI site. These data provided conclusive evidence for the cloning of the CP4 EPSPS gene and for the 
direction of transcription of this gene. These data coupled with the restriction mapping data also indicated that the 
complete gene was located on an —2.3 kb Xho\ fragment and this fragment was subcloned into pBlueScript. The 
nucleotide sequence of almost 2 kb of this fragment was determined by a combination of sequencing from cloned 
restriction fragments and by the use of specific primers to extend the sequence. The nucleotide sequence of the CP4 

30 EPSPS gene and flanking regions is shown in Figure 3 (SEQ ID NO:2). The sequence corresponding to peptide 
61-24-25 (SEQ ID NO:19) was also located. The sequence was determined using both the Sequenase kit from IBI 
(International Biotechnologies Inc.) and the T7 sequencing /Deaza Kit from Pharmacia. 

That the cloned gene encoded the EPSPS activity purified from the Agrobacterium sp. strain CP4 was verified in 
the following manner: By a series of site directed mutageneses, Bgh\ and Nco\ sites were placed at the N-terminus 

35 with the fMet contained within the Nco\ recognition sequence, the first internal Nco\ site was removed (the second 
internal Nco\ site was removed later), and a Sad site was placed after the stop codons. At a later stage the internal 
A/ofl site was also removed by site-directed mutagenesis. The following list includes the primers for the site-directed 
mutagenesis (addition or removal of restriction sites) of the CP4 EPSPS gene. Mutagenesis was carried out by the 
procedures of Kunkel et al. (1987), essentially as described in Sambrook et al. (1989). 

40 

PRIMER BgNc (addition of Bgh\ and Nco\ sites to N-terminus) 

CGTGGATAGATCTAGGAAGACAACCATGGCTCACGGTC 
45 (SEQIDNO:24) 

PRIMER Sph2 (addition of Sprt site to N-terminus) 

GGATAGATTAAGGAAGACGCGCATGCTTCACGGTGCAAGC 
50 AGCC (SEQ ID NO:25) 

PRIMER S1 (addition of Sad site immediately after stop codons) 

55 GGCTGCCTGATGAGCTCCACAATCGCCATCGATGG 
(SEQIDNO:26) 
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PRIMER N1 (removal of internal Noi\ recognition site) 

CGTCGCTCGTCGTGCGTGGCCGCCCTGACGGC 
$ (SEQIDNO;27) 

PRIMER Nco1 (removal of first internal Ned recognition site) 

CGGGCAAGGCCATGCAGGCTATGGGCGCC (SEQ ID NO:28) 

10 

PRIMER Nco2 (removal of second internal Ned recognition site) 

CGGGCTGCCGCCTGACTATGGGCCTCGTCGG (SEQ ID NO:29) 

15 This CP4 EPSPS gene was then cloned as a Ncd-BamH N-termina! fragment plus a BamH\Sac\ C-terminal 

fragment into a PrecA-gene10L expression vector similar to those described (Wong et al., 1 988; Olins et at., 1 988) to 
form pMON17101 . The K m for PEP and the Kj for glyphosate were determined for the EPSPS activity in crude lysates 
of pMON17101/GB100 transformants following induction with nalidixic acid (Wong et al., 1988) and found to be the 
same as that determined for the purified and crude enzyme preparations from Agrobacterium sp. strain CP4. 

20 

Characterization of the EPSPS gene from Achromobacter sp. strain LBAA and from Pseudomonas sp. strain PG2982 

A cosmid bank of partially H/ndlll-restricted LBAA DNA was constructed in £ coii MM294 in the vector pHC79 
(Hohn and Collins, 1 980). This bank was probed with a full length CP4 EPSPS gene probe by colony hybridization and 
25 positive clones were identified at a rate of — 1 per 400 cosmids. The LBAA EPSPS gene was further localized in these 
cosmids by Southern analysis. The gene was located on an ~2.8 kb Xhd fragment and by a series of sequencing 
steps, both from restriction fragment ends and by using the oligonucleotide primers from the sequencing of the CP4 
EPSPS gene, the nucleotide sequence of the LBAA EPSPS gene was completed and is presented in Figure 4 (SEQ 
ID NO:4). 

30 The EPSPS gene from PG2982 was also cloned. The EPSPS protein was purified, essentially as described for 

the CP4 enzyme, with the following differences: Following the Sepharose CL-4B column, the fractions with the highest 
EPSPS activity were pooled and the protein precipitated by adding solid ammonium sulfate to 85% saturation and 
stirring for 1 hour The precipitated protein was collected by centrifugation, resuspended in Q Sepharose buffer and 
following dialysis against the same buffer was loaded onto the column (as for the CP4 enzyme). After purification on 

35 the Q Sepharose column, -40 mg of protein in 100 mM Tris pH 7.8, 10% glycerol, 1 mM EDTA, 1 mM DTT, and 1 M 
ammonium sulfate, was loaded onto a Phenyl Superose (Pharmacia) column. The column was eluted at 1 .0 ml/minutes 
with a 40 ml gradient from 1 .0 M to 0.00 M ammonium sulfate in the above buffer. 

Approximately 1 .0 mg of protein from the active fractions of the Phenyl Superose 10/10 column was loaded onto 
a Pharmacia Mono P 5/10 Chromatofocusing column with a flow rate of 0.75 ml/minutes. The starting buffer was 25 
mM bis-Tris at pH 6.3, and the column was eluted with 39 ml of Polybuffer 74, pH 4.0. Approximately 50 |ig of the peak 
fraction from the Chromatofocusing column was dialyzed into 25 mM ammonium bicarbonate. This sample was then 
used to determine the N-terminal amino acid sequence. 
The N-terminal sequence obtained was: 

45 XHSASPKPATARRSE (where X = an unidentified residue) (SEQ ID NO:30). A number of degenerate oligonucle- 

otide probes were designed based on this sequence and used to probe a library of PG2982 partial-H/ndlll DNA in 
the cosmid pHC79 (Hohn and Collins, 1 980) by colony hybridization under nonstringent conditions. Final washing 
conditions were 15 minutes with 1X SSC, 0.1% SDS at 55°C. One probe with the sequence GCGGTBGCSG- 
GYTTSGG (where B = C, G, or T; S = C or G, and Y = C or T) (SEQ ID NO:31 ) identified a set of cosmid clones. 

so 

The cosmid set identified in this way was made up of cosmids of diverse HindiW fragments. However, when this 
set was probed with the CP4 EPSPS gene probe, a cosmid containing the PG2982 EPSPS gene was identified (des- 
ignated as cosmid 9C1 originally and later as pMON20107). By a series of restriction mappings and Southern analysis 
this gene was localized to a ~2.8 kb Xh<A fragment and the nucleotide sequence of this gene was determined. This 
55 DNA sequence (SEQ ID NO:6) is shown in Figure 5. There are no nucleotide differences between the EPSPS gene 
sequences from LBAA (SEQ ID NO:4) and PG2982 (SEQ ID NO:6). The kinetic parameters of the two enzymes are 
within the range of experimental error. 

A gene from PG2982 that imparts glyphosate tolerance in £ coli has been sequenced (Fitzgibbon, 1 988; Fitzgibbon 
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and Braymer, 1 990). The sequence of the PG2982 EPSPS Class II gene shows no homology to the previously reported 
sequence suggesting that the glyphosate tolerant phenotype of the previous work is not related to EPSPS. 

Alternative Isolation Protocols for Other Class II EPSPS Structural Genes 

5 

A number of Class II genes have been isolated and described here. It is clear that the initial gene cloning, that of 
the gene from CP4, was difficult due to the low degree of similarity between the Class I and Class II enzymes and 
genes. The identification of the other genes however was greatly facilitated by the use of this first gene as a probe. In 
the cloning of the LBAA EPSPS gene, the CP4 gene probe allowed the rapid identification of cosmid clones and the 

10 localization of the intact gene to a small restriction fragment and some of the CP4 sequencing primers were also used 
to sequence the LBAA (and PG2982) EPSPS gene(s). The CP4 gene probe was also used to confirm the PG2982 
gene clone. The high degree of similarity of the Class II EPSPS genes may be used to identify and clone additional 
genes in much the same way that Class I EPSPS gene probes have been used to clone other Class I genes. An 
example of the latter was in the cloning of the A tha liana EPSPS gene using the P hybrids gene as a probe (Klee et 

is al., 1987). 

Glyphosate tolerant EPSPS activity has been reported previously for EPSP synthases from a number of sources. 
These enzymes have not been characterized to any extent in most cases. The use of Class I and Class II EPSPS gene 
probes or antibody probes provide a rapid means of initially screening for the nature of the EPSPS and provide tools 
for the rapid cloning and characterization of the genes for such enzymes. 

20 Two of the three genes described were isolated from bacteria that were isolated from a glyphosate treatment facility 

(Strains CP4 and LBAA). The third (PG2982) was from a bacterium that had been isolated from a culture collection 
strain. This latter isolation suggests thai exposure to glyphosate may not be a prerequisite for the isolation of high 
glyphosate tolerant EPSPS enzymes and that the screening of collections of bacteria could yield additional isolates. 
It is possible to enrich for glyphosate degrading or glyphosate resistant microbial populations (Quinn et al., 1 988; Talbot 

25 et al., 1 984) in cases where it was felt that enrichment for such microorganisms would enhance the isolation frequency 
of Class II EPSPS microorganisms. Additional bacteria containing class II EPSPS gene have also been identified. A 
bacterium called C12, isolated from the same treatment column beads as CP4 (see above) but in a medium in which 
glyphosate was supplied as both the carbon and phosphorus source, was shown by Southern analysis to hybridize 
with a probe consisting of the CP4 EPSPS coding sequence. This result, in conjunction with that for strain LBAA, 

30 suggests that this enrichment method facilitates the identification of Class II EPSPS isolates. New bacterial isolates 
containing Class II EPSPS genes have also been identified from environments other than glyphosate waste treatment 
facilities. An inoculum was prepared by extracting soil (from a recently harvested soybean field in Jerseyville, Illinois) 
and a population of bacteria selected by growth at 28°C in Dworkin -Foster medium containing glyphosate at 10 mM 
as a source of carbon (and with cycloheximide at 100 u.g/ml to prevent the growth of fungi). Upon plating on L-agar 

35 media, five colony types were identified. Chromosomal DNA was prepared from 2ml L-broth cultures of these isolates 
and the presence of a Class II EPSPS gene was probed using a the CP4 EPSPS coding sequence probe by Southern 
analysis under stringent hybridization and washing conditions. One of the soil isolates, S2, was positive by this screen. 

Relationships between different EPSPS genes 

40 

The deduced amino acid sequences of a number of Class I and the Class II EPSPS enzymes were compared 
using the Bestfit computer program provided in the UWGCG package (Devereux et al. 1984). The degree of similarity 
and identity as determined using this program is reported. The degree of similarity/identity determined within Class I 
and Class II protein sequences is remarkably high, for instance, comparing E. co//with S. typhimurium (similarity/identity 

45 = 93%/88%) and even comparing E. coli with a plant EPSPS (Petunia hybrids', 72%/55%). This data is shown in Table 
IV. The comparison of sequences between Class I and Class II, however, shows only a very low degree of relatedness 
between the Classes (similarity/identity = 50-53%/23-30%). The display of the Bestfit analysis for the E.coli (SEQ ID 
NO:8) and CP4 (SEQ ID NO: 3) sequences shows the positions of the conserved residues and is presented in Figure 
6. Previous analyses of EPSPS sequences had noted the high degree of conservation of sequences of the enzymes 

50 and the almost invariance of sequences in two regions - the "20-35" and "95-107" regions (Gasser et al., 1988; num- 
bered according to the Petunia EPSPS sequence) - and these regions are less conserved in the case of CP4 and 
LBAA when compared to Class I bacterial and plant EPSPS sequences (see Figure 6 for a comparison of the E. coli 
and CP4 EPSPS sequences with the E. coli sequence appearing as the top sequence in the Figure). The corresponding 
sequences in the CP4 Class II EPSPS are: 

PGDKSISHRSFMFGGL (SEQ ID NO:32) and LDFGNAATGCRLT 
(SEQK>NO:33). 
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These comparisons show that the overall relateclness ot Class I and Class II is EPSPS proteins is low and that 
sequences in putative conserved regions have also diverged considerably. 

In the CP4 EPSPS an alanine residue is present at the B glycine101 ' position. The replacement of the conserved 
glycine (from the "95-107" region) by an alanine results in an elevated Kj for glyphosate and in an elevation in the 
for PEP in Class I EPSPS. In the case of the CP4 EPSPS, which contains an alanine at this position, the «„, for PEP 
is in the low range, indicating that the Class II enzymes differ in many aspects from the EPSPS enzymes heretofore 
characterized. 

Within the Class II isolates, the degree of similarity/identity is as high as that noted for that within Class I (Table 
IV). Figure 7 displays the Bestfit computer program alignment of the CP4 (SEQ ID NO:3) and LBAA (SEQ ID NO:5) 
EPSPS deduced amino acid sequences with the CP4 sequence appearing as the top sequence in the Figure. The 
symbols used in Figures 6 and 7 are the standard symbols used in the Bestfit computer program to designate degrees 
of similarity and identity. 



Table IV 



Comparison of relatedness of EPSPS protein sequences 1 Comparison between Class I and Class II EPSPS 


protein sequences 








similarity 


identity 


E. coli vs. CP4 


52.8 


26.3 


E. coli vs. LBAA 


52.1 


26.7 


S. typhimurium vs. CP4 


51.8 


25.8 


B. pertussis vs. CP4 


52.8 


27.3 


S. cerevisiaevs. CPA 


53.5 


29.9 


P. hybrida vs. CP4 


50.2 


23.4 


Comparison between Class I EPSPS protein sequences 




similarity 


identity 


E. coli vs. S. typhimurium 


93.0 


88.3 


R hybrida vs. E. coli 


71.9 


54.5 


Comparison between Class II EPSPS protein sequences 




similarity 


identity 


Agrobacterium sp. strain CP4vs. Achromobacter 


89.9 


83.7 


sp. strain LBAA 







1 The EPSPS sequences compared here were obtained from the following references: E coli, Rogers et a)., 1983; S. typhimurium, Stalker et al., 
1985; Petunia hybrida. Shah et al., 1966; B. pertussis, MaskeD et at., 1988; and S. c&revisiae, Duncan et al., 1987. 



One difference that may be noted between the deduced amino acid sequences of the CP4 and LBAA EPSPS 
proteins is at position 100 where an Alanine is found in the case of the CP4 enzyme and a Glycine is found in the case 
of the LBAA enzyme. In the Class I EPSPS enzymes a Glycine is usually found in the equivalent position, i.e Glycine96 
in E. coli and K. pneumoniae and Glycine101 in Petunia. In the case of these three enzymes it has been reported that 
converting that Glycine to an Alanine results in an elevation of the appKi for glyphosate and a concomitant elevation 
in the appKm for PEP (Kishore et al. 1986; Kishore and Shah, 1988; Sost and Amrhein, 1990), which, as discussed 
above, makes the enzyme less efficient especially under conditions of lower PEP concentrations. The Glycine100 of 
the LBAA EPSPS was converted to an Alanine and both the appKm for PEP and the appKi for glyphosate were de- 
termined for the variant. The Glycine 100 Alanine change was introduced by mutagenesis using the following primer: 

CGGCAATGCCGCCACCGGCGCGCGCC (SEQ ID NO:34) 

and both the wild type and variant genes were expressed in E. coliln a RecA promoter expression vector (pMON17201 
and pMON17264, respectively) and the appKm's and appKi's determined in crude lysates. The data indicate that the 
appKi(glyphosate) for the G100A variant is elevated about 16-fold (Table V). This result is in agreement with the ob- 
servation of the importance of this G-A change in raising the appKi(glyphosate) in the Class I EPSPS enzymes. How- 
ever, in contrast to the results in the Class I G-A variants, the appKm(PEP) in the Class II (LBAA) G-A variant is 
unaltered. This provides yet another distinction between the Class II and Class I EPSPS enzymes. 
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Table V 





appKm(PEP) 


appKi(glyphosate) 


Lysate prepared from: 

£. cotfpMON17201 (wild type) 

E. cotfpMON 17264 (G100A variant) 


5.3 u.M 
5.5 jiM 


28 u,M* 
459 u,M # 


@ range of PEP: 2-40 fiM 



range of glyphosate: 0-31 0 |iM; # range of glyphosate: 0-5000 i*M. 



The LBAA G100A variant, by virtue of its superior kinetic properties, is capable of imparting improved glyphosate in 
planta. 

Modification and Resynthesis of the Agrobacterium sp. strain CP4 EPSPS Gene Sequence 

The EPSPS gene from Agrobacterium sp. strain CP4 contains sequences that could be inimical to high expression 
of the gene in plants. These sequences include potential polyadenylation sites that are often and A+T rich, a higher 
G+C% than that frequently found in plant genes (63% versus -50%), concentrated stretches of G and C residues, 
and codons that are not used frequently in plant genes. The high G+C% in the CP4 EPSPS gene has a number of 
potential consequences including the following: a higher usage of G or C than that found in plant genes in the third 
position in codons, and the potential to form strong hair-pin structures that may affect expression or stability of the 
RNA. The reduction in the G+C content of the CP4 EPSPS gene, the disruption of stretches of G's and C's, the elim- 
ination of potential polyadenylation sequences, and improvements in the codon usage to that used more frequently in 
plant genes, could result in higher expression of the CP4 EPSPS gene in plants. 

A synthetic CP4 gene was designed to change as completely as possible those inimical sequences discussed 
above. In summary, the gene sequence was redesigned to eliminate as much as possible the following sequences or 
sequence features (while avoiding the introduction of unnecessary restriction sites): stretches of G's and C's of 5 or 
greater; and A+T rich regions (predominantly) that could function as polyadenylation sites or potential RNA destabili- 
zation region The sequence of this gene is shown in Figure 8 ( SEQ ID NO: 9). This coding sequence was expressed 
in E. coliirom the RecA promoter and assayed for EPSPS activity and compared with that from the native CP4 EPSPS 
gene. The apparent Km tor PEP for the native and synthetic genes was 11 .8 and 12.7, respectively, indicating that the 
enzyme expressed from the synthetic gene was u naltered . The N -terminus of the coding sequence was mutagenized 
to place an Sphl site at the ATG to permit the construction of the CTP2-CP4 synthetic fusion for chloroplast import. 
The following primer was used to accomplish this mutagenesis: 

GGACGGCTGCTTGCACCGTGAAGCATGCTTAAGCTTGGCGT 
AATCATGG (SEQ ID NO:35X 

Expression of Chloroplast Directed CP4 EPSPS 

The glyphosate target in plants, the 5-enofcyruvyl-shikimate-3-phosphate synthase (EPSPS) enzyme, is located 
in the chloroplast. Many chloroplast-localized proteins, including EPSPS, are expressed from nuclear genes as pre- 
cursors and are targeted to the chloroplast by a chloroplast transit peptide (CTP) that is removed during the import 
steps. Examples of other such chloroplast proteins include the small subunit (SSU) of Ribulose-1 ,5-bisphosphate car- 
boxylase (RUBISCO), Ferredoxin, Ferredoxin oxidoreductase, the Light-harvesting-complex protein I and protein II, 
and Thioredoxin F. It has been demonstrated in vivo and in vitroXnaX non-chloroplast proteins may be targeted to the 
chloroplast by use of protein fusions with a CTP and that a CTP sequence is sufficient to target a protein to the chlo- 
roplast. 

A CTP-CP4 EPSPS fusion was constructed between the Arabidopsis t ha liana EPSPS CTP (Klee et al., 1 987) and 
the CP4 EPSPS coding sequences. The Arabidopsis CTP was engineered by site-directed mutagenesis to place a 
Sphi restriction site at the CTP processing site. This mutagenesis replaced the Glu-Lys at this location with Cys-Met. 
The sequence of this CTP, designated as CTP2 (SEQ ID NO:10), is shown in Figure 9. The N-terminus of the CP4 
EPSPS gene was modified to place a Sph\ site that spans the Met codon. The second codon was converted to one 
for leucine in this step also. This change had no apparent effect on the in vivo activity of CP4 EPSPS in E. coii as 
judged by rate of complementation of the aroA allele. This modified N-terminus was then combined with the Sad C- 
terminus and cloned downstream of the CTP2 sequences. The CTP2-CP4 EPSPS fusion was cloned into pBlueScript 
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KS(+). This vector may be transcribed in vitro using the T7 polymerase and the RNA translated with ^S-Methionine 
to provide material that may be evaluated for import into chloroplasts isolated from Lactuca sativa using the methods 
described hereinafter (della-Cioppa et al., 1986, 1987). This template was transcribed in vitro using T7 polymerase 
and the 35 S-methionine-labeled CTP2-CP4 EPSPS material was shown to import into chloroplasts with an efficiency 
5 comparable to that for the control Petunia EPSPS (control = 35 S labeled PreEPSPS [pMON6140; della-Cioppa et al. t 
1986]). 

In another example the Arabidopsis EPSPS CTP, designated as CTP3, was fused to the CP4 EPSPS through an 
EcoRI site. The sequence of this CTP3 (SEQ ID NO: 12) is shown in Figure 10. An EcoRI site was introduced into the 
Arabidopsis EPSPS mature region around amino acid 27, replacing the sequence -Arg-Ala-Leu-Leu- with -Arg-lle-Leu- 
io Leu- in the process. The primer of the following sequence was used to modify the N-terminus of the CP4 EPSPS gene 
to add an EcoRI site to effect the fusion to the CTP3: 

GGAAGACGCCCAfiAATlCACGGTGCAAGCAGCCGG 

(SEQ ID NO:36) (the EcoBI site is underlined). 

is This CTP3-CP4 EPSPS fusion was also cloned into the pBlueScript vector and the T7 expressed fusion was found to 
also import into chloroplasts with an efficiency comparable to that for the control Petunia EPSPS (pMON6140). 

A related series of CTPs, designated as CTP4 (Spfi) and CTP5 (EcoRI), based on the Petunia EPSPS CTP and 
gene were also fused to the Sph\- and EcoRI -modified CP4 EPSPS gene sequences. The Sph\ site was added by 
site-directed mutagenesis to place this restriction site (and change the amino acid sequence to -Cys-Met-) at the chlo- 
20 roplast processing site. All of the CTP-CP4 EPSPS fusions were shown to import into chloroplasts with approximately 
equal efficiency. The CTP4 (SEQ ID NO: 14) and CTP5 (SEQ ID NO: 16) sequences are shown in Figures 11 and 12. 

A CTP2-LBAA EPSPS fusion was also constructed following the modification of the N-terminus of the LBAA E PSPS 
gene by the addition of a Sptti site. This fusion was also found to be imported efficiently into chloroplasts. 

By similar approaches, the CTP2-CP4 EPSPS and the CTP4-CP4 EPSPS fusion have also been shown to import 
2S efficiently into chloroplasts prepared from the leaf sheaths of corn. These results indicate that these CTP-CP4 fusions 
could also provide useful genes to impart gtyphosate tolerance in monocot species. 

Those skilled in the art will recognize that various chimeric constructs can be made which utilize the functionality 
of a particular CTP to import a Class II EPSPS enzyme into the plant cell chloroplast. The chloroplast import of the 
Class II EPSPS can be determined using the following assay. 

30 

Chloroplast Uptake Assay 

Intact chloroplasts are isolated from lettuce (Latuca sativa, var. longifolia) by centrif ugatbn in Percoll/ficoll gradients 
as modified from Bartlett et al (1982). The final pellet of intact chloroplasts is suspended in 0.5 ml of sterile 330 mM 
35 sorbitol in 50 mM Hepes-KOH, pH 7.7, assayed for chlorophyll (Arnon, 1949), and adjusted to the final chlorophyll 
concentration of 4 mg/ml (using sorbitol/Hepes). The yield of intact chloroplasts from a single head of lettuce is 3-6mg 
chlorophyll. 

A typical 300 u.l uptake experiment contained 5 mM ATP, 8.3 mM unlabeled methionine, 322 mM sorbitol, 58.3 mM 
Hepes-KOH (pH 8.0), 50 \i\ reticulocyte lysate translation products, and intact chloroplasts from L sativa (200 u.g 

40 chlorophyll). The uptake mixture is gently rocked at room temperature (in 10 x 75 mm glass tubes) directly in front of 
a fiber optic illuminator set at maximum light intensity (1 50 Watt bulb). Aliquot samples of the uptake mix (about 50 uJ) 
are removed at various times and fractionated over 100 uJ silicone-oil gradients (in 150 uJ polyethylene tubes) by 
centrif ugatbn at 11,000 X g for 30 seconds. Under these conditions, the intact chloroplasts form a pellet under the 
silicone-oil layer and the incubation medium (containing the reticulocyte lysate) floats on the surface. After centrif uga- 

45 tion, the silicone-oil gradients are immediately frozen in dry ice. The chloroplast pellet is then resuspended in 50-100 
uJ of lysis buffer (10 mM Hepes-KOH pH 7.5, 1 mM PMSF, 1 mM benzamidine, 5 mM e-amino-n-caproic acid, and 30 
(ig/ml aprotinin) and centrif uged at 15,000 X g for 20 minutes to pellet the thylakoid membranes. The clear supernatant 
(stromal proteins) from this spin, and an aliquot of the reticulocyte lysate incubation medium from each uptake exper- 
iment, are mixed with an equal volume of 2X SDS-PAGE sample buffer for electrophoresis (Laemmli, 1970). 

50 SDS-PAGE is carried out according to Laemmli (1970) in 3-17% (w/v) acrylamide slab gels (60 mm X 1.5 mm) 

with 3% (w/v) acrylamide stacking gels (5 mm X 1 .5 mm). The gel is fixed for 20-30 min in a solution with 40% methanol 
and 1 0% acetic acid. Then, the gel is soaked in EN 3 HANCE™ (DuPont) for 20-30 minutes, followed by drying the gel 
on a gel dryer. The gel is imaged by autoradiography, using an intensifying screen and an overnight exposure to 
determine whether the CP4 EPSPS is imported into the isolated chloroplasts. 

55 

PLANT TRANSFORMATION 

Plants which can be made glyphosate tolerant by practice of the present invention include, but are not limited to, 
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soybean, cotton, corn, canola, oil seed rape, flax, sugarbeet, sunflower, potato, tobacco, tomato, wheat, rice, alfalfa 
and lettuce as well as various tree, nut and vine species. 

A double-stranded DNA molecule of the present invention ("chimeric gene") can be inserted into the genome of a 
plant by any suitable method. Suitable plant transformation vectors include those derived from a Ti plasmid of Agro- 
s bacterium tumefaciens, as well as those disclosed, e.g., by Herrera-Estrella (1983), Bevan (1984), Klee (1985) and 
EPO publication 120,516 (Schilperoort et al.). In addition to plant transformation vectors derived from the Ti or root- 
inducing (Ri) plasmids of Agrobacterium, alternative methods can be used to insert the DNA constructs of this invention 
into plant cells. Such methods may involve, for example, the use of liposomes, electroporation, chemicals that increase 
free DNA uptake, free DNA delivery via microprojectile bombardment, and transformation using viruses or pollen. 

10 

Class II EPSPS Plant transformation vectors 

Class II EPSPS DNA sequences may be engineered into vectors capable of transforming plants by using known 
techniques. The following description is meant to be illustrative and not to be read in a limiting sense. One of ordinary 

*5 skill in the art would know that other plasmids, vectors, markers, promoters, etc. would be used with suitable results. 
The CTP2-CP4 EPSPS fusion was cloned as a Bgh\-EccB\ fragment into the plant vector pMON979 (described below) 
to form pMON17110, a map of which is presented in Figure 13. In this vector the CP4 gene is expressed from the 
enhanced CaMV35S promoter (E35S; Kay et al. 1987). A FMV35S promoter construct (pMON17116) was completed 
in the following way: The Sah-Noft and the Afofl-8o7ll fragments from pMON979 containing the Spc/AAC(3)-lll/cvfV and 

20 the pBR322/Right Border/NOS 3VCP4 EPSPS gene segment from pMON17110 were ligated with the Xho\-BgH\ 
FMV35S promoter fragment from pMON981 . These vectors were introduced into tobacco, cotton and canola. 

A series of vectors was also completed in the vector pMON977 in which the CP4 EPSPS gene, the CTP2-CP4 
EPSPS fusion, and the CTP3-CP4 fusion were cloned as Bgh\-Sad fragments to form pMON1 71 24, pMON171 19, and 
pMON17120, respectively. These plasmids were introduced into tobacco. A pMON977 derivative containing the 

25 CTP2-LBAA EPSPS gene was also completed (pMON17206) and introduced into tobacco. 

The pMON979 plant transformation/expression vector was derived from pMON886 (described below) by replacing 
the neomycin phosphotransferase typell (KAN) gene in pMON886 with the 0.89 kb fragment containing the bacterial 
gentamicin-3-N-acetyltransferase type III (AAC(3)-III) gene (Hayford et al., 198B). The chimeric P-35S/AA(3)-lll/NOS 
3' gene encodes gentamicin resistance which permits selection of transformed plant cells. pMON979 also contains a 

30 0.95 kb expression cassette consisting of the enhanced CaMV 35S promoter (Kay et al., 1 987), several unique restric- 
tion sites, and the NOS 3' end (P-En-CaMV35S/NOS 3*). The rest of the pMON979 DNA segments are exactly the 
same as in pMON886. 

Plasmid pMON886 is made up of the following segments of DNA. The first is a 0.93 kb Ava\ to engineered-EcoRV 
fragment isolated f rom transposon Tn7 that encodes bacterial spectinomycin/streptomycin resistance (Spc/Str), which 

35 js a determinant for selection in E coli and Agrobacterium tumefaciens. This is joined to the 1 .61 kb segment of DNA 
encoding a chimeric kanamycin resistance which permits selection of transformed plant cells. The chimeric gene (P- 
35S/KAN/NOS 3') consists of the cauliflower mosaic virus (CaMV) 35S promoter, the neomycin phosphotransferase 
typell (KAN) gene, and the 3'-nontranslated region of the nopaline synthase gene (NOS 3') (Fraley et al., 1983). The 
next segment is the 0.75 kb orN containing the origin of replication from the RK2 plasmid. It is joined to the 3.1 kb Sah 

40 to Pvu\ segment of pBR322 (ori322) which provides the origin of replication for maintenance in E coli and the bom 
site for the conjugational transfer into the Agrobacterium tumefaciens cells. The next segment is the 0.36 kb Pvul to 
Bch from p71T37 that carries the nopaline-type T-DNA right border (Fraley et al., 1985). 

The pMON977 vector is the same as pMON981 except for the presence of the P-En-CaMV35S promoter in place 
of the FMV35S promoter (see below). 

45 The pMON981 plasmid contains the following DNA segments: the 0.93 kb fragment isolated from transposon Tn7 

encoding bacterial spectinomycin/streptomycin resistance [Spc/Str; a determinant for selection in E coli and Agrobac- 
terium tumefaciens (Fling et al., 1985)]; the chimeric kanamycin resistance gene engineered for plant expression to 
allow selection of the transformed tissue, consisting of the 0.35 kb cauliflower mosaic virus 35S promoter (P-35S) 
(Odell et al., 1985), the 0.B3 kb neomycin phosphotransferase typell gene (KAN), and the 0.26 kb 3'-nontranslated 

50 region of the nopaline synthase gene (NOS 3') (Fraley et al., 1983); the 0.75 kb origin of replication from the RK2 
plasmid (orN) (Stalker et al., 1981); the 3.1 kb Sab to Pvu\ segment of pBR322 which provides the origin of replication 
for maintenance in E coli (ori-322) and the bom site for the conjugational transfer into the Agrobacterium tumefaciens 
cells, and the 0.36 kb Pvu\ to Bch fragment from the pTiT37 plasmid containing the nopaline-type T-DNA right border 
region (Fraley et al., 1 985). The expression cassette consists of the 0.6 kb 35S promoter from the figwort mosaic virus 

55 (P-FMV35S) (Gowda et al., 1989) and the 0.7 kb 3' non-translated region of the pea rbcS-E9 gene (E9 3') (Coruzzi et 
al. , 1 984, and Morelli et al. , 1 985). The 0.6 kb Ssp\ fragment containing the FMV35S promoter (Figure 1 ) was engineered 
to place suitable cloning sites downstream of the transcriptional start site. The CTP2-CP4syn gene fusion was intro- 
duced into plant expression vectors (including pMON981, to form pMON17131; Figure 14) and transformed into to- 
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bacco, canola, potato, tomato, sugarbeet, cotton, lettuce, cucumber, oil seed rape, poplar, and Arabidopsis. 

The plant vector containing the Class II EPSPS gene may be mobilized into any suitable Agrobacterium strain for 
transformation of the desired plant species. The plant vector may be mobilized into an ABI Agrobacterium strain. A 
suitable ABI strain is the A208 Agrobacterium tumefaciens carrying the disarmed Ti plasmid pTiC58 (pMP90RK) (Koncz 

s and Schell, 1 986). The Ti plasmid does not carry the T-DNA phytohormone genes and the strain is therefore unable 
to cause the crown gall disease. Mating of the plant vector into ABI was done by the triparental conjugation system 
using the helper plasmid pRK2013 (Ditta et al., 1980). When the plant tissue is incubated with the ABI::plant vector 
conjugate, the vector is transferred to the plant cells by the vir functions encoded by the disarmed pTiC58 plasmid. 
The vector opens at the T-DNA right border region, and the entire plant vector sequence may be inserted into the host 

10 plant chromosome. The pTiC58 Ti plasmid does not transfer to the plant cells but remains in the Agrobacterium. 

Class II EPSPS free DNA vectors 

Class II EPSPS genes may also be introduced into plants through direct delivery methods. A number of direct 
'5 delivery vectors were completed for the CP4 EPSPS gene. The vector pMON1 3640, a map of which is presented in 
Figure 15, is described here. The plasmid vector is based on a pUC plasmid (Vieira and Messing, 1987) containing, 
in this case, the npfll gene (kanamycin resistance; KAN) from Tn903 to provide a selectable marker in E. coil The 
CTP4- EPSPS gene fusion is expressed from the P-FMV35S promoter and contains the NOS 3' polyadenylation se- 
quence fragment and from a second cassette consisting of the E35S promoter, the CTP4-CP4 gene fusion and the 
20 NOS 3* sequences. The scoreable G US marker gene (Jefferson et al. 1 987) is expressed from the mannopine synthase 
promoter (P-MAS; Velten et al., 1984) and the soybean 7S storage protein gene 3* sequences (Schuler et al., 1982). 
Similar plasmids could also be made in which CTP-CP4 EPSPS fusions are expressed from the enhanced CaMV35S 
promoter or other plant promoters. Other vectors could be made that are suitable for free DNA delivery into plants and 
such are within the skill of the art and contemplated to be within the scope of this disclosure. 

25 

PLANT REGENERATION 

When expression of the Class II EPSPS gene is achieved in transformed cells (or protoplasts), the cells (or pro- 
toplasts) are regenerated into whole plants. Choice of methodology for the regeneration step is not critical, with suitable 
30 protocols being available for hosts from Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae (carrot, celery, pars- 
nip), Cruciferae (cabbage, radish, rapeseed, etc.), Cucurbitaceae (melons and cucumber), Gramineae (wheat, rice, 
corn, etc.), Solanaceae (potato, tobacco, tomato, peppers), various floral crops as well as various trees such as poplar 
or apple, nut crops or vine plants such as grapes. See, e.g., Ammirato, 1984; Shimamoto, 1989; Fromm, 1990; \fesil, 
1990. 

35 The following examples are provided to better elucidate the practice of the present invention and should not be 

interpreted in any way to limit the scope of the present invention. Those skilled in the art will recognize that various 
modifications, truncations, etc. can be made to the methods and genes described herein while not departing from the 
spirit and scope of the present invention. 

In the examples that follow, EPSPS activity in plants is assayed by the following method. Tissue samples were 

40 collected and immediately frozen in liquid nitrogen. One gram of young leaf tissue was frozen in a mortar with liquid 
nitrogen and ground to a fine powder with a pestle. The powder was then transferred to a second mortar, extraction 
buffer was added (1 ml/gram), and the sample was ground for an additional 45 seconds. The extraction buffer for 
Canola consists of 100 mM Tris, 1 mM EDTA, 10 % glycerol, 5 mM DTT, 1 mM BAM, 5 mM ascorbate, 1.0 mg/ml BSA, 
pH 7.5 (4°C). The extraction buffer for tobacco consists of 100 mM Tris, 10 mM EDTA, 35 mM KCl, 20 % glycerol, 5 

45 mM DTT, 1 mM BAM, 5 mM ascorbate, 1 .0 mg/ml BSA, pH 7.5 (4°C). The mixture was transferred to a microf uge tube 
and centrif uged for 5 minutes. The resulting supematants were desalted on spin G-50 (Pharmacia) columns, previously 
equilibrated with extraction buffer (without BSA), in 0.25 ml aliquots. The desalted extracts were assayed for EPSP 
synthase activity by radioactive HPLC assay. Protein concentrations in samples were determined by the BioRad mi- 
croprotein assay with BSA as the standard. 

50 Protein concentrations were determined using the BioRad Microprotein method. BSA was used to generate a 

standard curve ranging from 2 - 24 u.g. Either BOO uJ of standard or diluted sample was mixed with 200 uJ of concentrated 
BioRad Bradford reagent. The samples were vortexed and read at A(595) after ~ 5 minutes and compared to the 
standard curve. 

EPSPS enzyme assays contained HEPES (50 mM), shikimate-3-phosphate (2 mM), NH 4 molybdate (0.1 mM) and 
55 KF (5 mM), with or without glyphosate (0.5 or 1 .0 mM). The assay mix (30 uJ) and plant extract (1 0 uJ) were preincubated 
for 1 minute at 25°C and the reactions were initiated by adding 14 C-PEP (1 mM). The reactions were quenched after 
3 minutes with 50 uJ of 90% EtOH/0.1 M HOAc, pH 4.5. The samples were spun at 6000 rpm and the resulting super- 
natants were analyzed for 14 C-EPSP production by HPLC. Percent resistant EPSPS is calculated from the EPSPS 
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activities with and without glyphosate. 

The percent conversion of 14 C labeled PEP to 14 C EPSP was determined by HPLC radioassay using a C1 8 guard 
column (Brownlee) and an AX1 00 HPLC column (0.4 X 25 cm, Synchropak) with 0.28 M isocratic potassium phosphate 
eluant, pH 6.5, at 1 ml/min. Initial velocities were calculated by multiplying fractional turnover per unit time by the initial 
s concentration of the labeled substrate (1 mM). The assay was linear with time up to - 3 minutes and 30% turnover to 
EPSPS. Samples were diluted with 10 mM Tris, 10% glycerol, 10 mM DTT, pH 7.5 (4 a C) if necessary to obtain results 
within the linear range. 

In these assays DL-dithiotheitol (DTT), benzamidine (BAM), and bovine serum albumin (BSA, essentially globulin 
free) were obtained from Sigma. Phosphoeno^yruvate (PEP) was from Boeh ringer Mannheim and phosphor not- 
10 [1- 14 C]pyruvate (28 mCi/mmol) was from Amersham. 

EXAMPLE 1 

Transformed tobacco plants have been generated with a number of the Class II EPSPS gene vectors containing 

*5 the CP4 EPSPS DNA sequence as described above with suitable expression of the EPSPS. These transformed plants 
exhibit glyphosate tolerance imparted by the Class II CP4 EPSPS. 

Transformation of tobacco employs the tobacco leaf disc transformation protocol which utilizes healthy leaf tissue 
about 1 month old. After a 15-20 minutes surface sterilization with 1 0% Clorox plus a surfactant, the leaves are rinsed 
3 times in sterile water. Using a sterile paper punch, leaf discs are punched and placed upside down on MS104 media 

20 (MS salts 4.3 g/l, sucrose 30 g/l, B5 vitamins 500X 2 ml/l, NAA 0.1 mg/l, and BA 1 .0 mg/l) for a 1 day preculture. 

The discs are then inoculated with an overnight culture of a disarmed Agrobacterium ABI strain containing the 
subject vector that had been diluted 1/5 (ie: about 0.6 OD). The inoculation is done by placing the discs in centrifuge 
tubes with the culture. After 30 to 60 seconds, the liquid is drained off and the discs were blotted between sterile filter 
paper. The discs are then placed upside down on MS104 feeder plates with a filter disc to co-culture. 

2S After 2-3 days of co-culture, the discs are transferred, still upside down, to selection plates with MS104 media. 

After 2-3 weeks, callus tissue formed, and individual clumps are separated from the leaf discs. Shoots are cleanly cut 
from the callus when they are large enough to be distinguished from stems. The shoots are placed on hormone-free 
rooting media (MSO: MS salts 4.3 g/l, sucrose 30 g/l, and B5 vitamins 500X 2 ml/l) with selection for the appropriate 
antibiotic resistance. Root formation occurred in 1-2 weeks. Any leaf callus assays are preferably done on rooted 

30 shoots while still sterile. Rooted shoots are then placed in soil and kept in a high humidity environment (ie: plastic 
containers or bags). The shoots are hardened off by gradually exposing them to ambient humidity conditions. 

Expression of CP4 EPSPS protein in transformed plants 

35 Tobacco cells were transformed with a number of plant vectors containing the native CP4 EPSPS gene, and using 

different promoters and/or CTP's. Preliminary evidence for expression of the gene was given by the ability of the leaf 
tissue from antibiotic selected transformed shoots to recallus on glyphosate. In some cases, glyphosate tolerant callus 
was selected directly following transformation. The level of expression of the CP4 EPSPS was determined by the level 
of glyphosate tolerant EPSPS activity (assayed in the presence of 0.5 mM glyphosate) or by Western blot analysis 

40 using a goat anti-CP4 EPSPS antibody. The Western blots were quantitated by densitometer tracing and comparison 
to a standard curve established using purified CP4 EPSPS. These data are presented as % soluble leaf protein. The 
data from a number of transformed plant lines and transformation vectors are presented in Table VI below. 



Table VI 



45 


Expression of CP4 EPSPS in transformed tobacco tissue 


Vector 


Plant # 


CP4 EPSPS ~ (% leaf protein) 


50 


pMON17110 


25313 


0.02 


pMON17110 


25329 


0.04 




pMON17116 


25095 


0.02 




pMON17119 


25106 


0.09 




pMON17119 


25762 


0.09 


55 


pMON17119 


25767 


0.03 



Glyphosate tolerant EPSPS activity was also demonstrated in leaf extracts for these plants. 



Glyphosate tolerance has also been demonstrated at the whole plant level in transformed tobacco plants. In to- 
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bacco, R 0 transformants of CTP2-CP4 EPSPS were sprayed at 0.4 tb/acre (0.448 kg/hectare), a rate sufficient to kill 
control non-transformed tobacco plants corresponding to a rating of 3, 1 and 0 at days 7, 14 and 28, respectively, and 
were analyzed vegetatively and reproductively (Table VII). 



Table VII 



Glyphosate tolerance in R 0 tobacco CP4 transformants 


Spray rate = 0.4 lb/acre (0.448 kg/hectare) 


Vector/Plant # 


Score* 




Vegetative 


Fertile 




day7 


day 14 


day 28 




pMON 171 10/2531 3 


6 


4 


2 


no 


pMON 171 10/25329 


9 


10 


10 


yes 


pMON17119/25106 


9 


9 


10 


yes 



Plants are evaluated on a numerical scoring system of 0-1 0 where a vegetative score of 1 0 represents no damage relative to n on -sprayed controls 
and 0 represents a dead plant. Reproductive scores (Fertile) are determined at 28 days after spraying and are evaluated as to whether or not the 
plant is fertile. 



EXAMPLE 2 

Canola plants were transformed with the pMON17110, pMON17116, and pMON17131 vectors and a number of 
plant lines of the transformed canola were obtained which exhibit glyphosate tolerance. 

Plant Material 

Seedlings of Brassica napus cv Westar were established in 2 inch (-5 cm) pots containing Metro Mix 350. They 
were grown in a growth chamber at 24°C, 16/8 hour photoperiod, light intensity of 400 uEnrr 2 sec- 1 (HID lamps). They 
were fertilized with Peters 20-10-20 General Purpose Special. After 2 1/2 weeks they were transplanted to 6 inch (~ 
15 cm) pots and grown in a growth chamber at 15/10°C day/night temperature, 16/8 hour photoperiod, light intensity 
of 800 uEm- 2 sec" 1 (HID lamps). They were fertilized with Peters 15-30-15 Hi-Phos Special. 

Transformation/Selection/Regeneration 

Four terminal internodes from plants just prior to bolting or in the process of bolting but before flowering were 
removed and surfaced sterilized in 70% v/v ethanol for 1 minute, 2% w/v sodium hypochlorite for 20 minutes and rinsed 
3 times with sterile deionized water Stems with leaves attached could be refrigerated in moist plastic bags for up to 
72 hours prior to sterilization. Six to seven stem segments were cut into 5mm discs with a Redco Vegetable Slicer 200 
maintaining orientation of basal end. 

The Agrobacterium was grown overnight on a rotator at 24°C in 2mls of Luria Broth containing 50mg/l kanamycin, 
24mg/l chloramphenicol and 100mg/l spectinomycin. A 1:10 dilution was made in MS (Murashige and Skoog) media 
giving approximately 9x1 0 8 cells per ml. This was confirmed with optical density readings at 660 mu. The stem discs 
(explants) were inoculated with 1.0ml of Agrobacterium and the excess was aspirated from the explants. 

The explants were placed basal side down in petri plates containing 1/1 OX standard MS salts, B5 vitamins, 3% 
sucrose, 0.8% agar, pH 5.7, 1.0mg/l 6-benzyladenine (BA). The plates were layered with 1.5ml of media containing 
MS salts, B5 vitamins, 3% sucrose, pH 5.7, 4.0mg/l p-chlorophenoxyacetic acid, 0.005mg/l kinetin and covered with 
sterile filter paper. 

Following a 2 to 3 day co-culture, the explants were transferred to deep dish petri plates containing MS salts, B5 
vitamins, 3% sucrose, 0.8% agar, pH 5.7, 1mg/l BA, 500mg/l carbenicillin, 50mg/l cefotaxime, 200 mg/l kanamycin or 
175mg/l gentamicin for selection. Seven explants were placed on each plate. After 3 weeks they were transferred to 
fresh media, 5 explants per plate. The explants were cultured in a growth room at 25° C, continuous light (Cool White). 

Expression Assay 

After 3 weeks shoots were excised from the explants. Leaf recallusing assays were initiated to confirm modification 
of Rq shoots. Three tiny pieces of leaf tissue were placed on recallusing media containing MS salts, B5 vitamins, 3% 
sucrose, 0.8% agar, pH 5.7, 5.0mg/l BA, 0.5mg/l naphthalene acetic acid (NAA), 500mg/l carbenicillin, 50mg/l cefo- 
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taxime and 200mg/l kanamycin or gentamicin or 0.5mM glyphosate. The leaf assays were incubated in a growth room 
under the same conditions as explant culture. After 3 weeks the leaf recallusing assays were scored for herbicide 
tolerance (callus or green leaf tissue) or sensitivity (bleaching). 

s Transplantation 

At the time of excision, the shoot stems were dipped in Rootone® and placed in 2 inch (—5 cm) pots containing 
Metro-Mix 350 and placed in a closed humid environment. They were placed in a growth chamber at 24°C, 16/8 hour 
photoperiod, 400 uEm* 1 sec* 2 (HID lamps) for a harden ing-off period of approximately 3 weeks. 

to The seed harvested from R 0 plants is R n seed which gives rise to R-, plants. To evaluate the glyphosate tolerance 

of an R G plant, its progeny are evaluated. Because an R 0 plant is assumed to be hemizygous at each insert location, 
selfing results in maximum genotypic segregation in the R v Because each insert acts as a dominant allele, in the 
absence of linkage and assuming only one hemizygous insert is required for tolerance expression, one insert would 
segregate 3:1 , two inserts, 15:1 , three inserts 63:1 , etc. Therefore, relatively few Rj plants need be grown to find at 

'5 least one resistant phenotype. 

Seed from an R D plant is harvested, threshed, and dried before planting in a glyphosate spray test. Various tech- 
niques have been used to grow the plants for R 1 spray evaluations. Tests are conducted in both greenhouses and 
growth chambers. Two planting systems are used; - 10 cm pots or plant trays containing 32 or 36 cells. Soil used for 
planting is either Metro 350 plus three types of slow release fertilizer or plant Metro 350. Irrigation is either overhead 

20 jn greenhouses or sub-irrigation in growth chambers. Fertilizer is applied as required in irrigation water. Temperature 
regimes appropriate for canola were maintained. A sixteen hour photoperiod was maintained. At the onset of flowering, 
plants are transplanted to -1 5 cm pots for seed production. 

A spray ■batch" consists of several sets of R., progenies all sprayed on the same date. Some batches may also 
include evaluations of other than plants. Each batch also includes sprayed and unsprayed non-transgenic genotypes 

25 representing the genotypes in the particular batch which were putatively transformed. Also included in a batch is one 
or more non-segregating transformed genotypes previously identified as having some resistance. 

Two-six plants from each individual progeny are not sprayed and serve as controls to compare and measure 
the glyphosate tolerance, as well as to assess any variability not induced by the glyphosate. When the other plants 
reach the 2-4 leaf stage, usually 10 to 20 days after planting, glyphosate is applied at rates varying from 0.28 to 1.12 

30 kg/ha, depending on objectives of the study. Low rate technology using low volumes has been adopted. A laboratory 
track sprayer has been calibrated to deliver a rate equivalent to field conditions. 

A scale of 0 to 1 0 is used to rate the sprayed plants for vegetative resistance. The scale is relative to the unsprayed 
plants from the same R Q plant. A 0 is death, while a 10 represents no visible difference from the unsprayed plant. A 
higher number between 0 and 10 represents progressively less damage as compared to the unsprayed plant. Plants 

35 are scored at 7, 14, and 28 days after treatment (DAT), or until bolting, and a line is given the average score of the 
sprayed plants within an R G plant family. 

Six integers are used to qualitatively describe the degree of reproductive damage from glyphosate: 

0: No floral bud development 
4 0 2: Floral buds present, but aborted prior to opening 

4: Flowers open, but no anthers, or anthers fail to extrude past petals 

6: Sterile anthers 

8: Partially sterile anthers 

10: Fully fertile flowers 

45 

Plants are scored using this scale at or shortly after initiation of flowering, depending on the rate of floral structure 
development. 

Expression of EPS PS in Canola 

so 

After the 3 week period, the transformed canola plants were assayed for the presence of glyphosate tolerant 
EPSPS activity (assayed in the presence of glyphosate at 0.5mM). The results are shown in Table VIM. 



ss 
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Table Vill 



Expression of CP4 EPSPS In transformed Canola plants 


Vector Control 


Plant # 


% resistant EPSPS activity of leaf extract (at 0.5 mM glyphosate) 


pMON17110 


41 


0 
47 


pMON17110 


52 


28 


pMON17110 


71 


82 


pMON17110 


104 


75 


pMON17110 


172 


84 


pMON17110 


177 


85 


pMON17110 


252 


29* 


pMON17110 


350 


49 


pMON17116 


40 


25 


pMON17116 


99 


87 


pMON17116 


175 


94 


PMON17116 


178 


43 


pMON17116 


182 


18 


pMON17116 


252 


69 


pMON17116 


298 


44* 


pMON17116 


332 


89 


pMON17116 


383 


97 


pMON17116 


395 


52 



assayed in the presence of 1 .0 mM glyphosate 



Rt transformants of canola were then grown in a growth chamber and sprayed with glyphosate at 0.56 kg/ha 
(kilogram/hectare) and rated vegetatively. These results are shown in Table IXA - IXC. It is to be noted that expression 
of glyphosate resistant EPSPS in all tissues is preferred to observe optimal glyphosate tolerance phenotype in these 
transgenic plants, in the Tables below, only expression results obtained with leaf tissue are described. 



Table IXA 



Glyphosate tolerance In Class II EPSPS canola R 1 transformants 


(pMON17110 = P-E35S; pMON17116= P-FMV35S; R1 plants; Spray rate =0.56 kg/ha) 




% resistant 


Vegetative Score** 




Vector/Plant No. 


EPSPS* 


day 7 


day 14 


Control Westar 


0 


5 


3 


pMON1 711 0/41 


47 


6 


7 


pMON17110/71 


82 


6 


7 


pMON17110/177 


85 


9 


10 


pMON1 711 6/40 


25 


9 


9 


pMON17116/99 


87 


9 


10 


pMON17116/175 


94 


9 


10 


PMON17116/178 


43 


6 


3 


pMON17116/182 


18 


9 


10 


pMON17116/383 


97 


9 


10 



70 restsiam trsrs acuvny m ine presence or u.b mM giypnosaie 

* A vegetative score of 10 indicates no damage, a score of 0 is given to a dead plant. 



55 
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Table IXB 



Glyphosate tolerance In Class II EPSPS canola transformants 


(pMON17131 = P-FMV35S; R1 plants; Spray rate = 0.84 kg/ha) 


vector/riant no. 


Vegetative score** 


Reproductive score 




day 14 


day 28 


17131/78 


10 


10 


17131/102 


9 


10 


17131/115 


9 


10 


17131/116 


9 


10 


17131/157 


9 


10 


17131/169 


10 


10 


17131/255 


10 


10 


control Westar 


1 


0 



A vegetative score of 10 indicates no damage, a score of 0 is given to a dead plant. 



Table IXC 



Glyphosate tolerance in Class I EPSPS canola transformants 


(P-E35S; R2 Plants; Spray rate = 0.28 kg/ha) 




% resistant 


Vegetative Score 




Vector/Plant No. 


EPSPS* 


day 7 


day 14 


Control Westar 


0 


4 


2 


pMON899/715 


: 96 


5 


6 


pMON899/744 


95 


8 


8 


pMONB99/794 


86 


6 


4 


PMON899/818 


81 


7 


8 


pMON899/885 


57 


7 


6 



* % resistant EPSPS activity in the presence of 0.5 mM glyphosate 

** A vegetative score of 10 indicates no damage, a score of 0 is given to a dead plant. 



The data obtained for the Class II EPSPS transformants may be compared to glyphosate tolerant Class I EPSP 
transformants in which the same promoter is used to express the EPSPS genes and in which the level of glyphosate 
tolerant EPSPS activity was comparable for the two types of transformants. A comparison of the data of pMON17110 
[in Table IXA] and pMON171 31 [Table IXB] with that for pMON899 [in Table IXC; the Class I gene in pMON899 is that 
from A. thaliana (Klee et al., 1987) in which the glycine at position 101 was changed to an alanine] illustrates that the 
Class II EPSPS is at least as good as that of the Class I EPSPS. An improvement in vegetative tolerance of Class II 
EPSPS is apparent when one takes into account that the Class II plants were sprayed at twice the rate and were tested 
as R 1 plants. 



EXAMPLE 3 



Soybean plants were transformed with the pMON 13640 (Figure 15) vector and a number of plant lines of the 
transformed soybean were obtained which exhibit glyphosate tolerance. 

Soybean plants are transformed with pMON13640 by the method of microprojectile injection using particle gun 
technology as described in Christou et al. (1988). The seed harvested from r\ plants is R 1 seed which gives rise to 
R 1 plants. To evaluate the glyphosate tolerance of an R 0 plant, its progeny are evaluated. Because an R Q plant is 
assumed to be hemizygous at each insert location, selfing results in maximum genotypic segregation in the R, . Because 
each insert acts as a dominant allele, in the absence of linkage and assuming only one hemizygous insert is required 
for tolerance expression, one insert would segregate 3:1 , two inserts, 15:1 , three inserts 63:1 , etc. Therefore, relatively 
few R 1 plants need be grown to find at least one resistant phenotype. 
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Seed from an R Q soybean plant is harvested, and dried before planting in a glyphosate spray test. Seeds are 
planted into 4 inch (-5cm) square pots containing Metro 350. Twenty seedlings from each Ro plant is considered 
adequate for testing. Plants are maintained and grown in a greenhouse environment. A 12.5-14 hour photoperiod and 
temperatures of 30°C day and 24°C night is regulated. Water soluble Peters Pete Lite fertilizer is applied as needed. 
s A spray "batch" consists of several sets of R 1 progenies all sprayed on the same date. Some batches may also 

include evaluations of otherthan R, plants. Each batch also includes sprayed and unsprayed non-transgenic genotypes 
representing the genotypes in the particular batch which were putatively transformed. Also included in a batch is one 
or more non-segregating transformed genotypes previously identified as having some resistance. 

One to two plants from each individual Rq progeny are not sprayed and serve as controls to compare and measure 
io the glyphosate tolerance, as well as to assess any variability not induced by the glyphosate. When the other plants 
reach the first trifoliate leaf stage, usually 2-3 weeks after planting, glyphosate is applied at a rate equivalent of 128 
oz./acre (8.895kg/ha) of Roundup®. A laboratory track sprayer has been calibrated to deliver a rate equivalent to those 
conditions. 

A vegetative score of 0 to 10 is used. The score is relative to the unsprayed progenies from the same F\ plant. A 
is 0 is death, while a 10 represents no visible difference from the unsprayed plant. A higher number between 0 and 10 
represents progressively less damage as compared to the unsprayed plant. Plants are scored at 7, 14, and 28 days 
after treatment (DAT). The data from the analysis of one set of transformed and control soybean plants are described 
on Table X and show that the CP4 EPSPS gene imparts glyphosate tolerance in soybean also. 

20 Table X 



Glyphosate tolerance in Class I EPSPS soybean transformants 


(P-E35S, P-FMV35S; RO plants; Spray rate = 128 oz./acre) 


Vector/Plant No. 


Vegetative score 




day 7 


day 14 


day 28 


13640/40-11 


5 


6 


7 


13640/40-3 


9 


10 


10 


13640/40-7 


4 


7 


7 


control A5403 


2 


1 


0 


controlA5403 


1 


1 


0 



35 EXAMPLE 4 

The CP4 EPSPS gene may be used to select transformed plant material directly on media containing glyphosate. 
The ability to select and to identify transformed plant material depends, in most cases, on the use of a dominant 
selectable marker gene to enable the preferential and continued growth of the transformed tissues in the presence of 

40 a normally inhibitory substance. Antibiotic resistance and herbicide tolerance genes have been used almost exclusively 
as such dominant selectable marker genes in the presence of the corresponding antibiotic or herbicide. The nptll/kan- 
amycin selection scheme is probably the most frequently used. It has been demonstrated that CP4 EPSPS is also a 
useful and perhaps superior selectable marker/selection scheme for producing and identifying transformed plants. 
A plant transformation vector that may be used in this scheme is pMON 17227 (Figure 16). This plasm id resembles 

45 many of the other plasmids described infra and is essentially composed of the previously described bacterial replicon 
system that enables this plasmid to replicate in E. colt and to be introduced into and to replicate in Agrobacterium, the 
bacterial selectable marker gene (Spc/Str), and located between the T-DNA right border and left border is the 
CTP2-CP4 synthetic gene in the FMV35S promoter-E9 3' cassette. This plasmid also has single sites for a number of 
restriction enzymes, located within the borders and outside of the expression cassette. This makes it possible to easily 

so add other genes and genetic elements to the vector for introduction into plants. 

The protocol for direct selection of transformed plants on glyphosate is outlined for tobacco. Explants are prepared 
for pre-culture as in the standard procedure as described in Example 1 : surface sterilization of leaves from 1 month 
old tobacco plants (1 5 minutes in 1 0% clorox + surfactant; 3X dH 2 0 washes); explants are cut in 0.5 x 0.5 cm squares, 
removing leaf edges, mid-rib, tip, and petiole end for uniform tissue type; explants are placed in single layer, upside 

55 down, on MS 104 plates + 2 ml 4COOSK media to moisten surface; pre-culture 1-2 days. Explants are inoculated using 
overnight culture of Agrobacterium containing the plant transformation plasmid that is adjusted to a titer of 1 .2 X 10 9 
bacteria/ml with 4COOSK media. Explants are placed into a centrifuge tube, the Agrobacterium suspension is added 
and the mixture of bacteria and explants is "Vortexed" on maximum setting for 25 seconds to ensure even penetration 
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of bacteria. The bacteria are poured off and the explants are blotted between layers of dry sterile filter paper to remove 
excess bacteria. The blotted explants are placed upside down on MS104 plates + 2ml 4COOSK media + filter disc. 
Co-culture is 2-3 days. The explants are transferred to MS104 + Carbenicillin 1000 mg/l + cefotaxime 100 mg/l for 3 
days (delayed phase). The explants are then transferred to MS104 + glyphosate 0.05 mM + Carbenicillin 1000 mg/l + 

5 cefotaxime 100 mg/l for selection phase. At 4-6 weeks shoots are cut from callus and placed on MSO + Carbenicillin 
500 mg/l rooting media. Roots form in 3-5 days, at which time leaf pieces can be taken from rooted plates to confirm 
glyphosate tolerance and that the material is transformed. 

The presence of the CP4 EPSPS protein in these transformed tissues has been confirmed by immunoblot analysis 
of leaf discs. The data from one experiment with pMON 17227 is presented in the following: 139 shoots formed on 

*0 glyphosate from 400 explants inoculated with Agrobacterium ABVpbAON 17227; 97 of these were positive on recall using 
on glyphosate. These data indicate a transformation rate of 24 per 100 explants, which makes this a highly efficient 
and time saving transformation procedure for plants. Similar transformation frequencies have been obtained with 
pMON171 31 and direct selection of transformants on glyphosate with the CP4 EPSPS genes has also been shown in 
other plant species, including Arabidopsis, potato, tomato, cotton, lettuce, and sugarbeet. 

'5 The pMON 17227 plasmid contains single restriction enzyme recognition cleavage sites (Notl, Xhol, and BstXI ) 

between the CP4 glyphosate selection regbn and the left border of the vector for the cloning of additional genes and 
to facilitate the introduction of these genes into plants. 

EXAMPLE 5 

20 

The CP4 EPSPS gene has also been introduced into Black Mexican Sweet (BMS) corn cells with expression of 
the protein and glyphosate resistance detected in callus. 

The backbone for this plasmid was a derivative of the high copy plasmid pUC119 (Viera and Messing, 1987). The 
1.3Kb Fspl-Dral pUC11 9 fragment containing the origin of replication was fused to the 1.3Kb Smal-Hindlll filled trag- 
us ment from pKC7 (Rao and Rogers, 1979) which contains the neomycin phosphotransferase type II gene to confer 
bacterial kanamycin resistance. This plasmid was used to construct a monocot expression cassette vector containing 
the 0.6kb cauliflower mosaic virus (CaMV) 35S RNA promoter with a duplication of the -90 to -300 region (Kay et al., 
1987), an 0.8kb fragment containing an intron from a maize gene in the 5' untranslated leader region, followed by a 
polylinker and the 3' termination sequences from the nopaline synthase (NOS) gene (Fraley et al., 1983). A 1.7Kb 
30 fragment containing the 300bp chloroplast transit peptide from the Arabidopsis EPSP synthase fused in frame to the 
1.4Kb coding sequence for the bacterial CP4 EPSP synthase was inserted into the monocot expression cassette in 
the polylinker between the intron and the NOS termination sequence to form the plasmid pMON 19653 (Figure 17). 

pMON19653 DNA was introduced into Black Mexican Sweet (BMS) cells by co-bombardment with EC9 ( a plasmid 
containing a sulfonylurea- resistant form of the maize acetolactate synthase gene. 2.5mg of each plasmid was coated 
35 onto tungsten particles and introduced into log-phase BMS cells using a PDS-1 000 particle gun essentially as described 
(Klein et al., 1989). Transformants are selected on MS medium containing 20ppb chlorsutfuron. After initial selection 
on chlorsulfuron, the calli can be assayed directly by Western blot. Glyphosate tolerance can be assessed by trans- 
ferring the calli to medium containing 5mM glyphosate. As shown in Table XI, CP4 EPSPS confers glyphosate tolerance 
to com callus. 

40 



Table XI. 



Expression of CP4 In BMS Corn Callus - pMON 19653 


Line 


CP4 expression (% extracted protein) 


284 


0.006 % 


287 


0.036 


290 


0.061 


295 


0.073 


299 


0.113 


309 


0.042 


313 


0.003 



To measure CP4 EPSPS expression in corn callus, the following procedure was used: BMS callus (3 g wet weight) 
55 was dried on filter paper (Whatman#1 ) under vacuum, reweighed, and extraction buffer (500 u.l/g dry weight; 100 mM 
Tris, 1 mM EDTA, 10% glycerol) was added. The tissue was homogenized with a Wheaton overhead stirrer for 30 
seconds at 2.8 power setting. After centrifugation (3 minutes, Eppendorf microfuge), the supernatant was removed 
and the protein was quantitated (BioRad Protein Assay). Samples (50 u.g/well) were loaded on an SDS PAGE gel (Jule, 
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3-17%) along with CP4 EPSPS standard (10 ng), electrophoresed, and transferred to nitrocellulose similarly to a pre- 
viously described method (Padgette, 1987). The nitrocellulose blot was probed with goat anti-CP4 EPSPS IgG, and 
developed with 1-125 Protein G. The radioactive blot was visualized by autoradiography. Results were quantitated by 
densitometry on an LKB UltraScan XL laser densitomer and are tabulated below in Table X. 

5 



Table XII. 



Glyphosate resistance in BMS Corn Callus using pMON 19653 


Vector 


Experiment 


# chlorsulfuron-resistant lines 


# cross- resistant to Glyphosate 


19653 


253 


120 


81/120 = 67.5% 


19653 


254 


BO 


37/80 = 46% 


EC9 control 


253/254 


8 


0/8 = 0% 



15 

Improvements in the expression of Class I EPSPS could also be achieved by expressing the gene using stronger 
plant promoters, using better 3' polyadenylation signal sequences, optimizing the sequences around the initiation codon 
for ribosome loading and translation initiation, or by combination of these or other expression or regulatory sequences 
or factors. It would also be beneficial to transform the desired plant with a Class I EPSPS gene in conjunction with 
20 another glyphosate tolerant EPSPS gene or a gene capable of degrading glyphosate in order to enhance the glyphosate 
tolerance of the transformed plant. 

From the foregoing, it will be seen that this invention is one well adapted to attain all the ends and objects here- 
inabove set forth together with advantages which are obvious and which are inherent to the invention. 

It wilt be understood that certain features and subcombinations are of utility and may be employed without reference 
25 to other features and subcombinations. This is contemplated by and is within the scope of the claims. 

Since many possible embodiments may be made of the invention without departing from the scope thereof, it is 
to be understood that all matter herein set forth or shown in the accompanying drawings is to be interpreted as illustrative 
and not in a limiting sense. 

30 EXAMPLE 6 

The LBAA Class II EPSPS gene has been introduced into plants and also imparts glyphosate tolerance. Data on 
tobacco transformed with pMON17206 (infra) are presented in Table XIII. 



Table XIII- 



Tobacco Glyphosate Spray Test (pMON1 7206; E35S- CTP2-LBaa EPSPS; 0,4 Ibs/ac) 


Line 


7 Day Rating 


33358 


9 


34586 


9 


33328 


9 


34606 


9 


33377 


9 


34611 


10 


34607 


10 


34601 


9 


34589 


9 


Samsum (Control) 


4 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 
TCATCAAAAT ATTTACCAGC ATTCCAGATT GGGTTCAATC AACAACCTAC CAGCCATATC 
ACTTTATTCA AATTGGTATC CCCAAAACCA AGAACGAACT CCCATCCTCA AAGGTTTGTA 
ACGAAGAATT CTCAGTCCAA AGCCTCAACA AGCTCAGGGT ACAGAGTCTC CAAACCATTA 
CCCAAAAGCT ACAGGAGATC AATGAAGAAT CTTCAATCAA AGTAAACTAC TGTTCCAGCA 
CATGCATCAT GGTCAGTAAG TTTCAGAAAA AGACATCCAC CGAAGACTTA AAGTTAGTGG 
CCATCTTTGA AAGTAATCTT GTCAACATOC AGCAGCTGGC TTGTGGGGAC CAGACAAAAA 
AGGAATGGTG CACAATTGTT AGGCGCACCT ACCAAAAGCA TCTTTCCCTT TATTCCAAAG 
. ATAAAGCAGA TTCCTCTAGT ACAAGTGGGG AACAAAATAA CGTGGAAAAG AGCTGTCCTG 
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(A) LENGTH: 1982 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
s (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 62.. 1426 

io (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

AAGCCCGCCT TCTCTCCCGC CCTCCCCCCG CACACCCCTC CATAGATTAA CCAAGACGCC 60 



75 



20 



25 



30 



35 



40 



45 



SO 



ss 



C ATG TCG CAC CCT GCA AGC AGC CCC CCC CCA ACC GCC CGC AAA TCC 106 
Met Ser His Gly Ala Ser Ser Arg Pro Ala Thr Ala Arg Lyii Ser 
15 10 15 

TCT GGC CTT TCC GCA ACC GTC CGC ATT CCC GCC GAC AAG TCG ATC TCC 154 
Ser Gly Leu Ser Gly Thr Val Arg He Pro Gly Asp Lye Ser He Ser 
20 25 30 

CAC CCG TCC TTC ATG TTC GGC GGT CTC CCC AGC GGT GAA ACG CGC ATC 202 
Hi» Arg Ser Phe Met Phe Gly Gly Leu Ala Ser Gly Glu Thr Arg He 
35 40 45 
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ACC GGC CTT CTC GAA GCC GAG GAC GTC ATC AAT ACG GCC AAG CCC ATG 
Thr Gly Leu Leu Glu Gly Glu Asp Val lie Asn Thr Gly Lys Ala Met 
50 55 60 



250 



70 



75 



CAG GCC ATG GGC GCC AGG ATC CGT AAG GAA GGC GAC ACC TGG ATC ATC 
Gin Ala Met Gly Ala Arg lie Arg Lye Glu Gly Asp Thr Trp lie He 
65 70 75 

CAT GGC GTC GGC AAT GGC GCC CTC CTG GCG CCT GAG GCG CCG CTC GAT 
Aop Gly Val Gly Asn Gly Gly Leu Leu Ala Pro Glu Ala Pro Leu Asp 
80 85 90 95 

TTC GGC AAT GCC GCC ACG GGC TGC CGC CTG ACC ATG GGC CTC GTC GGG 
Phe Gly Asn Ala Ala Thr Gly Cye Arg Leu Thr Met Gly Leu Val Gly 
100 105 110 



298 



346 



394 



20 



GTC TAC CAT TTC GAC AGC ACC TTC ATC GGC GAC GCC TCG CTC ACA AAG 
Val Tyr Asp Phe Asp Ser Thr Phe He Gly Asp Ala Ser Leu Thr Lye 
115 120 125 



442 



25 



CGC CCG ATC CGC CCC CTG TTC AAC CCG CTG CGC GAA ATC GGC CTC CAG 490 
Arg Pro Met Gly Arg Val Leu Asn Pro Leu Arg Glu Met Gly Val Gin 
130 135 140 

CTG AAA TCG GAA GAC GCT GAC CGT CTT CCC GTT ACC TTC CGC GCG CCC 538 
Val Lys Ser Glu Asp Gly Asp Arg Leu Pro Val Thr Leu Arg Gly Pro 
145 150 155 



30 



AAG ACG CCG ACG CCG ATC ACC TAC CCC GTC CCG ATC CCC TCC CCA CAG 
Lye Thr Pro Thr Pro He Thr Tyr Arg Val Pro Met Ala Ser Ala Gin 
160 165 170 175 



536 



35 



CTG AAG TCC GCC GTG CTG CTC GCC GGC CTC AAC ACG CCC GGC ATC ACG 
Val Lys Ser Ala Val Leu Leu Ala Gly Leu Asn Thr Pro Gly He Thr 
180 185 190 



634 



ACG GTC ATC GAG CCG ATC ATG ACG CGC GAT CAT ACG GAA AAG ATG CTG 
Thr Val He Glu Pro He Met Thr Arg Asp His Thr Glu Lye Met Leu 
195 200 205 



682 



45 



CAG GGC TTT GGC GCC AAC CTT ACC GTC GAG ACG GAT GCG GAC CGC GTG 
Gin Gly Phe Gly Ala Aen Leu Thr Val Glu Thr Asp Ala Asp Gly Val 
210 215 220 

CCC ACC ATC CGC CTG GAA CGC CGC CCC AAC CTC ACC GGC CAA CTC ATC 
Arg Thr He Arg Leu Glu Gly Arg Gly Lye Leu Thr Gly Gin Val He 
225 230 235 



730 



778 



50 



GAC GTG CCG GGC GAC CCG TCC TCG ACC GCC TTC CCG CTC CTT CCC CCC 
Asp Val Pro Gly Asp Pro Ser Ser Thr Ala Phe Pro Leu Val Ala Ala 
240 245 250 255 



826 



55 



CTC CTT GTT CCG CGC TCC GAC GTC ACC ATC CTC AAC GTG CTG ATC AAC 
Leu Leu val Pro Gly Ser Asp Val Thr He Leu Asn Val Leu Met Asn 
260 265 270 



874 



33 



10 



25 



SO 
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CCC ACC CGC ACC CCC CTC ATC CTC ACC CTC CAC GAA ATC GCC GCC CAC 922 
Pro Thr Arg Thr Gly Leu Ho Leu Thr Leu Gin Glu Met Gly Ala Asp 
275 280 285 

ATC GAA GTC ATC AAC CCG CGC CTT CCC GCC CGC CAA GAC CTG CCC CAC 970 
He Glu Val He Asn Pro Arg Leu Ala Gly Gly Glu Aap Val Ala Asp 
290 295 300 

CTG CGC GTT CGC TCC TCC ACG CTG AAG GGC GTC ACG CTG CCG GAA GAC 1018 
Leu Arg Val Arg Ser Ser Thr Leu Lye Gly Val Thr Val Pro Glu Asp 
305 310 315 



CCC GCC CCT TCG ATG ATC CAC CAA TAT CCG ATT CTC GCT GTC GCC GCC 1066 
Arg Ala Pro Ser Met He Aap Glu Tyr Pro He Leu Ala Val Ala Ala 
1S 320 325 330 335 

GCC TTC CCG CAA GGG GCG ACC CTG ATG AAC CGT CTG CAA CAA CTC CGC 1114 
Ala Phe Ala Glu Gly Ala Thr Val Met Asn Gly Leu Glu Glu Leu Arg 
340 345 350 

20 

GTC AAG CAA AGC GAC CGC CTC TCG GCC GTC GCC AAT CGC CTC AAC CTC 1162 
Val Lye Glu Ser Asp Arg Leu Ser Ala Val Ala Aan Gly Leu Lys Leu 
355 360 365 

AAT GGC CTG GAT TCC GAT GAG GGC GAG ACG TCC CTC CTC GTG CGC CCC 1210 
Aan Gly Val Aap Cys Asp Glu Gly Glu Thr Ser Leu Val Val Arg Gly 
370 375 380 

CGC CCT GAC GGC AAG GGG CTC GGC AAC GCC TCG GGC GCC GCC GTC GCC 12 58 

Arg Pro Asp Gly Lys Gly Leu Gly Asn Ala Ser Gly Ala Ala Val Ala 
30 385 390 395 

ACC CAT CTC GAT CAC CCC ATC GCC ATG AGC TTC CTC GTC ATG CGC CTC 1306 
Thr His Leu Asp His Arg He Ala Met Ser Phe Leu Val Met Gly Leu 
400 405 410 415 

35 

GTG TCG CAA AAC CCT GTC ACG GTG GAC GAT GCC ACG ATG ATC GCC ACG 1354 
Val Ser Glu Asn Pro Val Thr Val Asp Asp Ala Thr Met He Ala Thr 
420 425 430 

40 AGC TTC CCG GAG TTC ATG GAC CTG ATC GCC GGG CTG GGC CCG AAC ATC 1402 

Ser Phe Pro Glu Phe Met Asp Leu Met Ala Gly Leu Gly Ala Lys He 
435 440 445 

CAA CTC TCC GAT ACC AAG GCT CCC TGATGACCTT CACAATCGCC ATCGATGCTC 1456 
4 5 Glu Leu Ser Asp Thr Lys Ala Ala 

450 455 

CCCCTGCGGC CGGCAAGGGG ACGCTCTCGC G CCC TAT CGC GGAGCTCTAT GGCTTTCATC 1516 

ATCTCGATAC CGGCCTGACC TATCGCGCCA CGGCCAAAGC GCTGCTCGAT CGCGGCCTCT 1576 

CGCTTGATCA CGAGGCGGTT GCGCCCGATC TCGCCCCCAA TCTCCATCTT GCCGGGCTCG 1636 

ACCGGTCGGT GCTGTCGCCC CATGCCATCG GCGAGGCGGC TTCGAAGATC CCGGTCATGC 1696 

55 CCTCGGTCCG CCGGGCGCTG G TCG AG GCG C AGCGCACCTT TGCGGCGCCT CAGCCGGGCA 1756 



34 
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CGGTCCTGCA TCGACGCCAT ATCGGCACGG TGGTCTGCCC GGATGCGCCG CTGAAGCTCT 1816 

ATGTCACCGC GTCACCGGAA GTGCGCGCGA AACGCCGCTA TGACGAAATC CTCGGCAATG 1876 

CCGGCTTGCC CGATTACGGG ACGATCCTCG AGGATATCCG CCGCCGCGAC GAGCGGGACA 1936 

TGGGTCGGGC GGACAGTCCT TTGAAGCCCG CCGACGATGC GCACTT 1932 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

is (A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



10 



20 



25 



30 



40 



45 



SO 



55 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
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Met Ser Hie Gly Ala Ser Ser Arg Pro Ala Thr Ala Arg Lye Ser Ser 
15 10 15 

Gly Leu Ser Gly Thr Val Arg lie Pro Gly Asp Lys Ser lie Ser His 
20 25 30 

Arg Ser Phe Met Phe Gly Gly Leu Ala Ser Gly Glu Thr Arg He Thr 
35 40 45 

Gly Leu Leu Glu Gly Glu Asp Val He Asn Thr Gly Lya Ala Met Gin 
50 55 60 

Ala Met Gly Ala Arg He Arg Lys Glu Gly Asp Thr Trp He He Asp 
65 70 75 80 

Gly Val Gly Asn Gly Gly Leu Leu Ala Pro Glu Ala Pro Leu Asp Phe 
85 90 9$ 

Gly Aan Ala Ala Thr Gly Cye Arg Leu Thr Met Gly Leu Val Gly Val 
100 105 no 

Tyr Asp Phe Asp Ser Thr Phe He Gly Asp Ala Ser Leu Thr Lys Arg 
115 120 125 

Pro Met Gly Arg Val Leu Asn Pro Leu Arg Glu Met Gly Val Gin Val 
130 135 140 

Lys Ser Glu Asp Gly Asp Arg Leu Pro Val Thr Leu Arg Gly Pro Lys 
145 150 155 160 

Thr Pro Thr Pro Xle Thr Tyr Arg Val Pro Met Ala Ser Ala Gin Val 
165 170 175 



Lys Ser Ala Val Leu Leu Ala Gly Leu Asn Thr Pro Gly He Thr Thr 
180 185 190 
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Val II© Glu Pro He Met Thr Arg Asp Hie Thr Glu Lys Met Leu Gin 
195 200 205 

Gly Phe Gly Ala Asn Leu Thr Val Glu Thr Aep Ala Asp Cly Val Arg 
210 215 220 

Thr He Arg Leu Glu Gly Arg Gly Lys Leu Thr Gly Gin Val He Asp 
225 230 235 240 

Val Pro Gly Asp Pro Ser Ser Thr Ala Phe Pro Leu Val Ala Ala Leu 
245 250 255 



Leu Val Pro Gly Ser Asp Val Thr He Leu Asn Val Leu Met Asn Pro 
260 265 270 

Thr Arg Thr Gly Leu He Leu Thr Leu Gin Glu Met Gly Ala Asp He 
275 280 285 

Glu Val He Asn Pro Arg Leu Ala Gly Gly Glu Asp Val Ala Asp Leu 
290 295 300 



Arg Val Arg Ser Ser Thr Leu Lys Gly Val Thr Val Pro Glu Asp Arg 

305 310 315 320 

Ala Pro Ser Met He Asp Glu Tyr Pro He Leu Ala Val Ala Ala Ala 

325 330 335 



30 Phe Ala Glu Gly Ala Thr Val Met Asn Gly Leu Glu Glu Leu Arg Val 

340 345 350 

Lys Glu Ser Asp Arg Leu Ser Ala Val Ala Asn Gly Leu Lys Leu Asn 
355 360 365 



Gly Val Asp Cys Asp Glu Gly Glu Thr Ser Leu Val Val Arg Gly Arg 
370 375 380 



Pro Asp Gly Lys Gly Leu Gly Asn Ala Ser Gly Ala Ala Val Ala Thr 
40 3S5 390 395 400 

His Leu Asp His Arg He Ala Met Ser Phe Leu Val Met Gly Leu Val 
405 410 415 

45 Ser Glu Asn Pro Val Thr Val Asp Asp Ala Thr Met He Ala Thr Ser 

420 425 430 

Phe Pro Glu Phe Met Asp Leu Met Ala Gly Leu Gly Ala Lys He Glu 
435 440 445 

so 

Leu Ser Asp Thr Lys Ala Ala 
450 455 



55 (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
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20 



(A) LENGTH: 1673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

s 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

10 (A) NAME/KEY: CDS 

(B) LOCATION: 86.. 1432 

(xt) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

15 GTAGCCACAC ATAATTACTA TAGCTAGCAA GCCCGCTATC TCTCAATCCC GCGTGATCGC 60 

GCCAAAATGT GACTCTGAAA AATCC ATG TCC CAT TCT CCA TCC CCC AAA CCA 112 

Met Ser Hie Ser Ala Ser Pro Lys Pro 
1 5 

CCA ACC CCC CCC CCC TCG GAG CCA CTC ACG CGC GAA ATC CGC ATT CCC 160 
Ala Thr Ala Arg Arg Ser Glu Ala Leu Thr Cly Clu He Arg He Pro 
10 15 20 25 

25 CGC GAC AAC TCC ATC TCO CAT CGC TCC TTC ATC TTT CCC GCT CTC CCA 206 

Cly Asp Lys Ser He Ser His Arg Ser Phe Met Phe Gly Gly Leu Ala 
30 35 40 

TCG GCC GAA ACC CGC ATC ACC GGC CTT CTG GAA CCC GAG GAC CTC ATC 256 
30 Ser Gly Glu Thr Arg He Thr Cly Leu Leu Clu Cly Clu Aap Val He 

45 50 55 

AAT ACA GGC CGC GCC ATG CAG GCC ATG GCC GCG AAA ATC CCT AAA GAC 304 
Aen Thr Gly Arg Ala Met Gin Ala Met Gly Ala Lys He Arg Lys Glu 

35 60 65 70 

CGC CAT CTC TCG ATC ATC AAC GCC GTC GCC AAT CCC TGC CTG TTG CAG 352 
Cly Asp Val Trp He He Asn Cly Val Gly Asn Gly Cys Leu Leu Cln 
75 80 85 

CCC CAA CCT GCG CTC CAT TTC CCC AAT CCC CCA ACC GCC GCC CGC CTC 400 
Pro Clu Ala Ala Leu Asp Phe Gly Asn Ala Gly Thr Cly Ala Arg Leu 
90 95 100 105 

ACC ATC GGC CTT GTC CCC ACC TAT GAC ATC AAC ACC TCC TTT ATC CCC 44B 
Thr Met Cly Leu Val Cly Thr Tyr Asp Met Lys Thr Ser Phe He Cly 
110 115 120 

GAC GCC TCG CTG TCC AAG CGC CCG ATC GGC CCC GTC CTG AAC CCG TTC 496 
Asp Ala Ser Leu Ser Lys Arg Pro Met Cly Arg Val Leu Asn Pro Leu 
50 125 130 135 

CCC CAA ATC CCC CTT CAG CTG GAA CCA CCC CAT CCC CAC CCC ATG CCC 544 

Arg Clu Met Gly Val Cln val Glu Ala Ale Asp Cly Asp Arg Met Pro 

140 145 150 

55 



40 



45 
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CTC ACC CTG ATC GCC CCC AAG ACG CCC AAT COG ATC ACC TAT CCC CTC 592 

Lou Thr Lou He Gly Pro Lys Thr Ala Aan Pro He Thr Tyr Arg Val 
155 160 165 

5 

CCG ATC GCC TCC GCG CAG GTA AAA TCC GCC CTG CTG CTC CCC GGT CTC 640 

Pro Met Ala Ser Ala Gin Val Lye Ser Ma Val Leu Leu Ala Gly Leu 
170 17S ISO 185 

10 AAC ACG CCG GGC GTC ACC ACC CTC ATC CAG CCG GTC ATG ACC CCC CAC 688 

Asn Thr Pro Gly Val Thr Thr Val He Glu Pro Val Met Thr Arg Asp 
190 195 200 

CAC ACC GAA AAG ATG CTG CAG GGC TTT GGC GCC GAC CTC ACG GTC GAG 736 
75 His Thr Glu Lys Met Leu Gin Gly Phe Gly Ala Asp Leu Thr Val Glu 

205 210 215 

ACC GAC AAG GAT GGC GTG CGC CAT ATC CCC ATC ACC GGC CAG GGC AAG 784 
Thr Asp Lys Asp Gly Val Arg His He Arg lie Thr Gly Gin Gly Lys 
2Q 220 225 230 

CTT GTC GGC CAG ACC ATC GAC GTG CCG GGC GAT CCG TCA TCG ACC GCC 832 
Leu val Gly Gin Thr He Asp Val Pro Gly Asp Pro Ser Ser Thr Ala 
235 240 245 

25 TTC CCG CTC CTT GCC GCC CTT CTG CTG GAA GGT TCC GAC GTC ACC ATC 880 

Phe Pro Leu Val Ala Ala Leu Leu Val Glu Gly ser Asp Val Thr He 
250 255 260 265 

CGC AAC GTG CTG ATG AAC CCG ACC CCT ACC GCC CTC ATC CTC ACC TTC 928 
30 Arg Asn Val Leu Met Asn Pro Thr Arg Thr Gly Leu He Leu Thr Leu 

270 275 280 

CAG GAA ATG GCC GCC GAT ATC GAA GTG CTC AAT CCC CGT CTT CCA GGC 976 
Gin Glu Met Gly Ala Asp He Glu Val Leu Asn Ala Arg Leu Ala Gly 

35 265 290 295 

GGC GAA GAC GTC GCC GAT CTG CGC GTC AGG GCT TCC AAG CTC AAG GGC 1024 
Gly Glu Asp Val Ala Asp Leu Arg Val Arg Ala Ser Lys Leu Lys Gly 
300 305 310 

40 

GTC GTC GTT CCG CCG GAA CGT GCG CCG TCG ATG ATC GAC GAA TAT CCG 1072 
Val Val Val Pro Pro Glu Arg Ala Pro Ser Met He Asp Glu Tyr Pro 
315 320 325 

4S GTC CTG GCG ATT CCC GCC TCC TTC GCG GAA GGC GAA ACC GTG ATG GAC 1120 

Val Leu Ala He Ala Ala Ser Phe Ala Glu Gly Glu Thr Val Met Asp 
330 335 340 345 

CCG CTC GAC GAA CTG CGC GTC AAC GAA TCG CAT CGT CTG CCA GCG GTC 1168 
Gly Leu Asp Glu Leu Arg Val Lys Glu Ser Asp Arg Leu Ala Ala Val 
50 350 355 360 

GCA CGC GGC CTT GAA GCC AAC GGC GTC GAT TGC ACC GAA GGC GAC ATG 1216 
Ala Arg Gly Leu Glu Ala Aba Gly Val Asp Cys Thr Glu Gly Glu Met 
365 370 375 

55 
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10 



15 



20 



TOO CTC ACG GTT CGC GCC CCC CCC GAC GCC AAG GGA CTC CGC CCC GCC 1264 
Sor Leu Thr Val Arg Oly Arg Pro Asp Oly Lye Gly Leu Oly Gly Gly 
380 385 390 

ACC GTT GCA ACC CAT CTC GAT CAT CCT ATC GCG ATC ACC TTC CTC CTG 1312 
Thr Val Ala Thr Hia Leu Aap Hifl Arg lie Ala Met Ser Phe Leu Val 
39S 400 405 

ATG GCC CTT GCG GCG GAA AAG CCG GTG ACG GTT GAC GAC ACT AAC ATC 1360 
Met Gly Leu Ala Ala Glu Lye Pro Val Thr Val Asp Asp Ser Asn Met 
410 415 420 42S 

ATC GCC ACG TCC TTC CCC GAA TTC ATG GAC ATG ATG CCG GGA TTC CCC 1408 
He Ala Thr Ser Phe Pro Glu Phe Met Asp Met Met Pro Gly Leu Gly 
430 435 440 

GCA AAG ATC GAG TTG AGC ATA CTC TAGTCACTCG ACACCGAAAA TATTATTTGC 1462 
Ala Lye He Glu Leu Ser He Leu 
445 



GACATTGGCC ATTATTACCG GTTGGTCTCA GCGGGGGTTT AATGTCCAAT CTTCCATACC 1522 

TAACAGCATC AGCAAATATC AAAAAACCTT TACAAGCAAT TGCTACACCA GCCACGCCGC 1582 

25 CTAAGCTTTC TCAACACTTC GTTAAAACTG TACTGAAATC CCGGGCGGTC CGGGGATCAA 1642 

ATCACTTCAT TT CTC AG AAA TTCGCCTCGC A 1673 



30 (2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



45 



50 



55 
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Met Ser Hie Ser Ala Ser Pro Lye Pro Ala Thr Ala Arg Arg Ser Glu 
15 10 15 

Ala Leu Thr Gly Glu lie Arg lie Pro Gly Aep Lye Ser lie Ser Hie 
20 25 30 

Arg Ser Phe Met Phe Gly Gly Leu Ala Ser Gly Glu Thr Arg He Thr 
35 40 45 

Gly Leu Leu Glu Gly Glu Aep Val He Asn Thr Gly Arg Ala Met Gin 
50 55 60 

Ala Met Gly Ala Lye He Arg Lye Glu Gly Aep Val Trp He He Aon 
65 70 75 80 

Gly Val Gly Asn Gly Cye Leu Leu Gin Pro Glu Ala Ala Leu Asp Phe 
85 90 95 
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Gly Aon Ala Gly Thr Cly Ala Arg Leu Thr Mot Gly Leu Val Cly Thr 
100 105 110 

Tyr Asp Met Lye Thr Ser Phe lie Gly Asp Ala Ser teu Ser Lye Arg 
115 120 125 

Pro Hat Cly Arg Val Leu Asn Pro Leu Arg Clu Met Cly Val Gin val 
130 135 140 

Glu Ala Ala Asp Gly Asp Arg Met Pro Leu Thr Leu lie Gly Pro Lye 
145 150 155 160 

Thr Ala Asn Pro lie Thr Tyr Arg Val Pro Met Ala Ser Ala Gin Val 
165 170 175 

Lye Ser Ala Val Leu Leu Ala Gly Leu Asn Thr Pro Gly Val Thr Thr 
180 1B5 190 

Val He Glu Pro Val Met Thr Arg Asp His Thr Glu Lys Met Leu Gin 
195 200 205 

Gly Phe Gly Ala Asp Leu Thr Val Glu Thr Asp Lys Asp Gly Val Arg 
210 215 220 

His He Arg He Thr Gly Gin Gly Lys Leu Val Gly Gin Thr He Asp 
225 230 235 240 

Val Pro Gly Asp Pro Ser Ser Thr Ala Phe Pro Leu Val Ala Ala Leu 
245 250 255 

Leu Val Glu Gly Ser Asp Val Thr He Arg Asn Val Leu Met Asn Pro 
260 265 270 

Thr Arg Thr Gly Leu He Leu Thr Leu Gin Glu Met Gly Ala Asp He 
275 280 285 

Glu Val Leu Asn Ala Arg Leu Ala Gly Gly Glu Asp Val Ala Asp Leu 
290 295 300 

Arg Val Arg Ala Ser Lys Leu Lys Gly Val Val Val Pro Pro Glu Arg 

305 310 315 320 

Ala Pro Ser Met He Asp Glu Tyr Pro Val Leu Ala He Ala Ala Ser 

325 330 335 

Phe Ala Glu Gly Glu Thr Val Met Asp Gly Leu Asp Glu Leu Arg Val 
340 345 350 

Lye Glu Ser Asp Arg Leu Ala Ala Val Ala Arg Cly Leu Glu Ala Asn 
355 360 365 

Gly Val Asp Cys Thr Glu Gly Glu Met Ser Leu Thr Val Arg Gly Arg 
370 375 380 



Pro Asp Gly Lys Gly Leu Gly Gly Gly Thr Val Ala Thr His Leu Asp 

365 390 395 400 
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His Kxq II© Ala Met Ser Phe Leu val Mot Cly Leu Ala Ala Clu Lye 
405 410 415 

Pro Val Thr Val Asp Asp Ser Asn Met He Ala Thr ser phe Pro Glu 
420 425 430 

Phe Met Aep Met Met Pro Gly Leu Gly Ala Lya He Glu Leu Ser lie 
435 440 445 

Leu 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 34. . 1 380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
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GTGATCGCGC CAAAATGTGA CTGTGAAAAA TCC ATG TCC CAT TCT GCA TCC CCG 54 

Met Ser His Ser Ala Ser Pro 
1 5 

AAA CCA GCA ACC GCC CGC CGC TCC GAG GCA CTC ACG GGC GAA ATC CGC 102 
Lye Pro Ala Thr Ala Arg Arg Ser Glu Ala Leu Thr Gly Glu He Arg 
10 15 20 

ATT CCG GGC GAC AAC TCC ATC TCG CAT CGC TCC TTC ATG TTT CGC GCT 150 
He Pro Gly Asp Lye Ser He Ser His Arg Ser Phe Met Phe Gly Gly 
25 30 35 

75 CTC GCA TCG GGC GAA ACC CGC ATC ACC GGC CTT CTG GAA CGC GAG GAC 198 

Leu Ala Ser Gly Glu Thr Arg He Thr Gly Leu Leu Glu Gly Glu Asp 
40 45 50 55 

GTC ATC AAT ACA GGC CGC GCC ATG CAG GCC ATG GGC GCG AAA ATC CGT 246 
20 Val He Asn Thr Gly Arg Ala Met Gin Ala Met Gly Ala Lys He Arg 

60 65 70 

AAA GAG GGC GAT GTC TGG ATC ATC AAC GGC GTC GGC AAT GGC TGC CTG 294 
Lys Glu Gly Asp Val Trp He He Asn Gly Val Gly Asn Gly Cys Leu 
25 75 80 85 

TTC CAG CCC GAA GCT GCG CTC GAT TTC GGC AAT GCC GGA ACC GGC GCG 342 
Leu Gin Pro Glu Ala Ala Leu Asp Phe Gly Asn Ala Gly Thr Gly Ala 
90 9S 100 

30 



35 



40 



45 



50 



55 
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OCC CTC ACC ATG GGC CTT CTC CCC ACC TAT CAC ATG AAG ACC TCC TTT 390 
Arg Leu Thr Met Cly Leu Val Gly Thr Tyr Asp Met Lye Thr Ser Phe 
105 110 115 

ATC CCC CAC CCC TCG CTC TCC AAG CCC CCC ATG CCC CCC CTC CTC AAC 438 
He Gly Aep Ala Ser Leu Ser Lys Arg Pro Met Cly Arg Val Leu Aon 
120 125 130 135 



CCG TTG CCC GAA ATG GGC GTT CAC GTG GAA CCA GCC CAT CCC GAC CCC 486 
Pro Leu Arg Clu Met Cly Val Gin Val Glu Ala Ala Aep Cly Asp Arg 
140 145 150 

ATG CCG CTG ACG CTG ATC GGC CCG AAG ACG GCC AAT CCG ATC ACC TAT 534 
Met Pro Leu Thr Leu He Gly Pro Lys Thr Ala Asn Pro He Thr Tyr 
155 160 165 

CCC GTG CCG ATG GCC TCC CCG CAC GTA AAA TCC GCC GTG CTG CTC GCC 582 
Arg Val Pro Met Ala Ser Ala Gin Val Lya Ser Ala Val Leu Leu Ala 
170 175 180 

CCT CTC AAC ACC CCG GGC GTC ACC ACC GTC ATC GAG CCG CTC ATG ACC 630 
Gly Leu Asn Thr Pro Gly Val Thr Thr Val He Glu Pro Val Met Thr 
185 190 195 

CCC GAC CAC ACC GAA AAG ATG CTG CAG GGC TTT GGC GCC GAC CTC ACG 678 
Arg Asp His Thr Glu Lya Met Leu Gin Gly Phe Gly Ala Aep Leu Thr 
200 205 210 215 

30 CTC GAG ACC GAC AAG GAT GGC GTG CGC CAT ATC CCC ATC ACC GGC CAG 726 

Val Glu Thr Asp Lys Asp Gly Val Arg His He Arg He Thr Gly Gin 
220 225 230 

GGC AAG CTT GTC GGC CAG ACC ATC GAC GTG CCG GGC GAT CCG TCA TCG 774 
35 Gly Lys Leu Val Gly Gin Thr He Asp Val Pro Gly Asp Pro Ser Ser 

235 240 245 

ACC GCC TTC CCG CTC GTT GCC GCC CTT CTG GTG GAA GOT TCC GAC GTC 822 
Thr Ala Phe Pro Leu Val Ala Ala Leu Leu Val Glu Gly Ser Asp Val 
250 255 260 



IS 



20 



25 



40 



AS 



50 



SS 



ACC ATC CCC AAC GTG CTG ATG AAC CCC ACC CCT ACC GCC CTC ATC CTC 870 
Thr He Arg Asn Val Leu Met Asn Pro Thr Arg Thr Gly Leu He Leu 
265 270 275 

ACC TTG CAG GAA ATG GGC GCC GAT ATC GAA GTG CTC AAT GCC CGT CTT 918 
Thr Leu Gin Glu Met Gly Ala Asp He Glu Val Leu Asn Ala Arg Leu 
280 285 290 295 

GCA GGC GGC GAA GAC GTC GCC GAT CTG CGC GTC AGG GCT TCG AAG CTC 966 
Ala Gly Gly Glu Asp Val Ala Asp Leu Arg Val Arg Ala Ser Lys Leu 
300 305 310 

AAG GGC GTC GTC GTT CCG CCG GAA CGT GCG CCG TCG ATG ATC GAC GAA 1014 
Lye Gly Val Val Val Pro Pro Glu Arg Ala Pro Ser Met He Asp Glu 
315 320 325 
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TAT CCG CTC CTG CCC ATT CCC GCC TCC TTC COG CAA CCC CAA ACC CTC 1062 
Tyr Pro Val Leu Ala lie Ala Ala Ser Pho Ala Glu Gly Glu Thr Val 
330 335 340 

ATC GAC GGG CTC GAC GAA CTG CGC GTC AAG GAA TCG GAT CGT CTG GCA 1110 
Met Asp Gly Leu Asp Glu Leu Arg Val Lye Glu Ser Asp Arg Leu Ala 
345 350 355 

GCG GTC GCA CCC CGC CTT GAA GCC AAC GGC CTC CAT TCC ACC GAA CCC 1158 
Ala Val Ala Arg Gly Leu Glu Ala Asn Gly Val Aap Cys Thr Glu Gly 

360 365 370 375 

GAG ATC TCG CTG ACG CTT CGC GGC CGC CCC GAC GGC AAG GGA CTG GGC 1206 
Glu Met Ser Leu Thr Val Arg Gly Arg Pro Asp Gly Lye Gly Leu Gly 
380 385 390 

GCC CGC ACG CTT CCA ACC CAT CTC GAT CAT CGT ATC CCG ATG ACC TTC 1254 
Gly Gly Thr Val Ala Thr His Leu Asp Hia Arg He Ala Met Ser Phe 
395 400 405 

CTC GTC ATC GGC CTT GCG GCC CAA AAG CCC CTG ACG CTT GAC GAC AGT 1302 
Leu Val Met Gly Leu Ala Ala Glu Lye Pro Val Thr Val Asp Asp Ser 
410 415 420 

AAC ATG ATC GCC ACG TCC TTC CCC GAA TTC ATG GAC ATG ATG CCG GGA 1350 
Aan Met He Ala Thr Ser Phe Pro Glu Phe Met Asp Met Met Pro Gly 
425 430 435 

TTC GGC GCA AAG ATC GAC TTC AGC ATA CTC TACTCACTCG ACAGCGAAAA 1400 
Leu Gly Ala Lys He Glu Leu Ser He Leu 
440 445 

35 TATTATTTCC GAGATTCCGC ATTATTACCG GTTCGTCTCA GCCCGGCTTT AATGTCCAAT 1460 

CTTCCATACC TAACAGCATC ACGAAATATC AAAAAAGCTT 1500 



70 



15 



20 



25 



30 



40 (2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 amino acids 
« (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

*> (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



55 
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Met ser Hie Ser Ala Ser Pro Lys Pro Ala Thr Ala Arg Arg Ser Glu 
15 10 15 

Ala Leu Thr Gly Glu He Arg He Pro Cly Asp Lye Ser He Ser His 
20 25 30 

Arg Ser Phe Met Phe Gly Gly Leu Ala Ser Gly Glu Thr Arg He Thr 
35 40 45 
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Gly Leu Leu Glu Gly Glu Asp Val lie Asa Thr Gly Arg Ala Met Gin 

50 55 60 

Ala Met Gly Ala Lye lie Arg Lyo Glu Gly Asp Val Trp lie lie Asn 

65 70 75 80 

Gly Val Gly Asn Gly Cys Leu Leu Gin Pro Glu Ala Ala Leu Aap Phe 
85 90 95 

Gly Asn Ala Gly Thr Gly Ala Arg Leu Thr Met Gly Leu Val Gly Thr 
100 105 110 

Tyr Aep Met Lyo Thr Ser Phe lie Gly Asp Ala Ser Leu Ser Lys Arg 
115 120 125 

Pro Met Gly Arg Val Leu Asn Pro Leu Arg Glu Met Gly Val Gin Val 
130 135 140 

Glu Ala Ala Aep Gly Aep Arg Met Pro Leu Thr Leu lie Gly Pro Lys 
145 150 155 160 

Thr Ala Asn Pro lie Thr Tyr Arg Val Pro Met Ala Ser Ala Gin Val 
165 170 175 

Lys Ser Ala Val Leu Leu Ala Gly Leu Asn Thr Pro Gly Val Thr Thr 
180 185 190 

Val lie Glu Pro Val Met Thr Arg Asp His Thr Glu Lys Met Leu Gin 
195 200 205 

Gly Phe Gly Ala Asp Leu Thr Val Glu Thr Asp Lys Asp Gly Val Arg 
210 215 220 

His lie Arg lie Thr Gly Gin Gly Lys Leu Val Gly Gin Thr lie Asp 
225 230 235 240 

Val Pro Gly Asp Pro Ser Ser Thr Ala Phe Pro Leu Val Ala Ala Leu 
245 250 255 

Leu Val Glu Gly Ser Asp Val Thr He Arg Asn Val Leu Met Asn Pro 
260 265 270 

Thr Arg Thr Gly Leu lie Leu Thr Leu Gin Glu Met Gly Ala Asp He 
275 280 285 

Glu Val Leu Asn Ala Arg Leu Ala Gly Gly Glu Asp Val Ala Asp Leu 
290 295 300 

Arg Val Arg Ala Ser Lys Leu Lys Gly Val Val Val Pro Pro Glu Arg 
305 310 315 320 

Ala Pro Ser Met He Asp Glu Tyr Pro Val Lou Ala He Ala Ala Ser 

325 330 335 



Phe Ala Glu Gly Glu Thr Val Met Asp Gly Leu Asp Glu Leu Arg Val 
340 345 350 



48 



EP 0 546 090 B1 



Lys Glu Ser Aep Arg Leu Ala Ala Val Ala Arg Gly Leu Glu Ala Asn 

355 360 365 

Gly Val ABp Cys Thr Glu Gly Glu Met Ser Leu Thr Val Arg Gly Arg 
370 375 380 

Pro Asp Gly Lys Gly Leu Gly Gly Gly Thr Val Ala Thr Hia Leu Asp 
385 390 395 400 

Hia Arg lie Ala Met Ser Phe Leu Val Met Gly Leu Ala Ala Glu Lys 
405 410 415 

Pro Val Thr Val Asp Aap Ser Asn Met He Ala Thr Ser Phe Pro Glu 
420 425 430 

Phe Met Aap Met Met Pro Gly Leu Gly Ala Lys He Glu Leu Ser He 
435 440 445 

Leu 



(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Ser Leu Thr Leu Gin Pro He Ala Arg Val Aap Gly Thr He Asn Leu 
15 10 15 

Pro Gly Ser Lye Thr Val Ser Aan Arg Ala Leu Leu Leu Ala Ala Leu 
20 25 30 

Ala Hia Gly Lys Thr Val Leu Thr Asn Leu Leu Asp Ser Asp Asp Val 
35 40 45 

Arg Hie Met Leu Asn Ala Leu Thr Ala Leu Gly Val Ser Tyr Thr Leu 
50 55 60 

Ser Ala Asp Arg Thr Arg Cya Glu lie He Gly Asn Gly Gly Pro Leu 
65 70 75 60 

His Ala Glu Gly Ala Leu Glu Leu Phe Leu Gly Asn Ala Gly Thr Ala 
85 90 95 

Met Arg Pro Leu Ala Ala Ala Leu Cys Leu Gly Ser Aan Asp He Val 

100 105 110 
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Leu Thr Cly Clu Pro Arg Ret Lye Clu Arg Pro lie Cly Hie Leu Val 
115 120 125 

Asp Ala Leu Arg Leu Cly. Gly Ala Lya He Thr Tyr Leu Glu Gin Clu 
130 135 140 

Ann Tyr Pro Pro Leu Arg Leu Gin Gly Gly Phe Thr Cly Gly Asn Val 
145 150 155 160 

Asp Val Asp Gly Ser Val Ser Ser Gin Phe Leu Thr Ala Leu Leu Met 
165 170 175 



1S 



20 



2S 



30 



Thr Ala Pro Leu Ala Pro Glu Asp Thr Val He Arg He Lys Gly Asp 
180 185 190 

Leu Val Ser Lys Pro Tyr He Asp He Thr Leu Asn Leu Met Lys Thr 
195 200 205 

Phe Gly Val Clu He Glu Asn Gin His Tyr Gin Gin Phe Val Val Lys 
210 215 220 

Cly Cly Gin Ser Tyr Gin Ser Pro Gly Thr Tyr Leu Val Glu Gly Asp 
225 230 235 240 

Ala Ser Ser Ala Ser Tyr Phe Leu Ala Ala Ala Ala He Lys Gly Gly 
245 250 255 

Thr Val Lys Val Thr Cly He Cly Arg Asn Ser Met Cln Cly Asp He 
260 265 270 

Arg Phe Ala Asp Val Leu Clu Lys Met Cly Ala Thr He Cys Trp Cly 
275 280 285 



35 



40 



Asp Asp Tyr He Ser Cys Thr Arg Cly Clu Leu Asn Ala He Asp Met 
290 295 300 

Asp Met Asn His He Pro Asp Ala Ala Met Thr He Ala Thr Ala Ala 
305 310 315 320 

Leu Phe Ala Lys Cly Thr Thr Arg Leu Arg Aan He Tyr Asn Trp Arg 
325 330 335 



Val Lys Clu Thr Asp Arg Leu Phe Ala Met Ala Thr Glu Leu Arg Lys 
340 345 350 

Val Gly Ala Clu Val Clu Clu Cly His Asp Tyr He Arg He Thr Pro 
355 360 365 



so 



ss 



Pro Clu Lys Leu Asn Phe Ala Clu He Ala Thr Tyr Asn Asp His Arg 
370 375 380 

Met Ala Met Cys Phe Ser Leu Val Ala Leu ser Asp Thr Pro Val Thr 
385 390 395 400 

He Leu Asp Pro Lys Cys Thr Ala Lys Thr Phe Pro Asp Tyr Phe Clu 
405 410 415 
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Gin Leu Ala Arg lie Ser Gin 
420 

5 (2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 base pairs 
io (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CCATGGCTCA CGGTGCAAGC AGCCGTCCAG CAACTGCTCG TAAGTCCTCT GGTCTTTCTG 60 

20 GAACCGTCCG TATTCCAGGT CACAAGTCTA TCTCCCACAG GTCCTTCATG TTTGGAGGTC 120 

TCCCTAGCGG TGAAACTCGT ATCACCGGTC TTTTCGAAGG TGAAGATCTT ATCAACACTG 180 

GTAAGGCTAT GCAAGCTATC CGTGCCAGAA TCCGTAAGGA AGGTGATACT TCGATCATTG 240 

25 

ATGGTGTTGG TAACGGTGGA CTCCTTGCTC CTGAGGCTCC TCTCGATTTC GGTAACGCTG 300 

CAACTGGTTG COGTTTGACT ATGGGTCTTG TTGCTGTTTA CGATTTCCAT AGCACTTTCA 360 

30 

TTGGTGACGC TTCTCTCACT AACCGTCCAA TGGGTCGTGT GTTGAACCCA CTTCGCGAAA 420 

TGGGTCTGCA GGTGAAGTCT GAAGACGGTG ATCGTCTTCC AGTTACCTTG CGTGGACCAA 480 

35 AGACTCCAAC GCCAATCACC TACAGGCTAC CTATGGCTTC CGCTCAAGTG AAGTCCGCTG 540 

TTCTGCTTGC TGGTCTCAAC ACCCCAGGTA TCACCACTGT TATCGAGCCA ATCATGACTC 600 

CTGACCACAC TGAAAAGATG CTTCAAGGTT TTGGTGCTAA CCTTACCGTT GAGACTGATG 660 

40 

CTGACGGTGT GCGTACCATC CGTCTTGAAG GTCGTGGTAA GCTCACCGGT CAAGTGATTG 720 

ATGTTCCACG TGATCCATCC TCTACTGCTT TCCCATTGCT TCCTGCCTTG CTTGTTCCAG 780 

45 CTTCCCACCT CACCATCCTT AACGTTTTCA TGAACCCAAC CCGTACTGGT CTCATCTTGA 840 

CTCTGCAGGA AATGGGTGCC GACATCGAAG TGATCAACCC ACGTCTTGCT GGTGGAGAAG 900 

ACGTGGCTGA CTTGCGTGTT CGTTCTTCTA CTTT C AACGC TGTTACTGTT CCAGAAGACC 960 

so 

GTGCTCCTTC TATGATCGAC GAGTATCCAA TTCTCGCTGT TGCAGCTGCA TTCGCTGAAG 1020 

G7GCTACCGT TATGAACGGT TTGGAAGAAC TCCGTGTTAA GGAAAGCGAC GGTCTTTCTG 1080 

55 CTGTCGCAAA CGGTCTCAAG CTCAACGGTG TTGATTGCGA TGAAGCTGAG ACTTCTCTCC 1140 

TCGTGCCTCC TCGTCCTGAC CGTAAGGCTC TCGGTAACGC TTCTCCAGCA GCTGTCGCTA 1200 
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CCCACCTCGA TCACCGTATC GCTATGAGCT TCCTCGTTAT GGGTCTCCTT TCTGAAAACC 1260 
CTGTTACTGT TGATGATGCT ACTATGATCG CTACTACCTT CCCAGACTTC ATGGATTTGA 1320 

5 

TGGCTGGTCT TGGAGCTAAC ATCGAACTCT CCGACACTAA GCCTCCTTGA TCAGCTC 1377 

(2) INFORMATION FOR SEQ ID NO: 10: 
10 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
« (D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 87.. 317 



20 



25 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
AGATCTATCG ATAAGCTTGA TGTAATTGGA GGAAGATCAA AATTTTCAAT CCCCATTCTT 60 

CGATTGCTTC AATTGAAGTT TCTCCG ATG GOG CAA GTT AGC AGA ATC TGC AAT 113 

Met Ala Gin Val Ser Arg He Cye A&n 
1 5 

GGT CTG CAG AAC CCA TCT CTT ATC TCC AAT CTC TCC AAA TCC ACT CAA 161 
Gly Val Gin Asn Pro Ser Leu He ser Asn Leu Ser Lye Ser Ser Gin 
10 IS 20 25 

CGC AAA TCT CCC TTA TCG GTT TCT CTG AAG ACG CAG CAG CAT CCA CGA 209 
Arg Lye Ser Pro Leu Ser Val Ser Leu Lys Thr Gin Gin Hia Pro Arg 
30 35 40 

GCT TAT COG ATT TCG TCG TCG TGC GGA TTG AAG AAG ACT GGG ATG ACG 257 
Ala Tyr Pro He Ser Ser Ser Trp Gly Leu Lye Lye Ser Gly Met Thr 
45 50 55 

TTA ATT GGC TCT GAG CTT CCT CCT CTT AAG CTC ATG TCT TCT CTT TCC 305 
Leu He Gly Ser Glu Leu Arg Pro Leu Lye Val Met Ser Ser Val Ser 
60 65 70 

ACG GCG TGC ATG C 318 
Thr Ala Cye Met 
75 



(2) INFORMATION FOR SEQ ID NO:11: 
(i) SEQUENCE CHARACTERISTICS: 

55 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 

5 Met Ala Gin Val Ser Arg He Cys ABn Gly Val Gin Asn Pro Ser Leu 

15 10 15 

He Ser Aen Leu Ser Lya Ser Ser Gin Arg Lys Ser Pro Leu Ser Val 
20 25 30 

10 

Ser Leu Lys Thr Gin Gin His Pro Arg Ala Tyr Pro He Ser Ser Ser 
35 40 45 

Trp Gly Leu Lys Lys Ser Gly Met Thr Leu He Gly Ser Glu Leu Arg 
75 50 55 60 

Pro Leu Lys Val Met Ser Ser Val Ser Thr Ala Cys Met 
65 70 75 

20 

(2) INFORMATION FOR SEQ ID NO:12: 
(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
35 (B) LOCATION: 87..401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

AGATCTATCG ATAAGCTTGA TGTAATTGGA GGAAGATCAA AATTTTCAAT CCCCATTCTT 60 

40 

CGATTGCTTC AATTCAAGTT TCTCCG ATC GCG CAA GTT ACC AC A ATC TGC AAT 113 

Met Ala Gin Val Ser Arg He Cys Asn 
1 5 

45 CCT GTC CAG AAC CCA TCT CTT ATC TCC AAT CTC TCC AAA TCC AGT CAA 161 
Gly Val Gin Asn Pro Ser Leu He Ser Asn Leu Ser Lys Ser Ser Gin 
10 15 20 25 

5Q CGC AAA TCT CCC TTA TCG GTT TCT CTG AAG ACC CAG CAG CAT CCA CGA 209 
Arg Lys Ser Pro Leu Ser Val Ser Leu Lys Thr Gin Gin His Pro Arg 
30 35 40 

CCT TAT CCG ATT TCC TCG TCG TGG GGA TTG AAG AAG ACT GGG ATG ACG 257 
55 Ala Tyr Pro He Ser Ser Ser Trp Gly Leu Lys Lys Ser Gly Met Thr 
45 50 55 
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TTA ATT OCC TCT GAG CTT OCT CCT CTT AAG GTC ATG TCT TCT CTT TCC 305 

Leu lie Gly Ser Glu Leu Arg Pro Leu Lye Val Met Ser Ser Val Ser 
60 65 70 

5 

ACG GCG GAG AAA GCG TCG GAG ATT GTA CTT CAA CCC ATT AG A GAA ATC 353 

Thr Ala Glu Lye Ala Ser Glu He Val Leu Gin Pro He Arg Glu He 
75 80 85 

10 TCC OCT CTT ATT AAG TTG CCT CCC TCC AAG TCT CTA TCA AAT AGA ATT 401 

Ser Gly Leu He Lys Leu Pro Gly Ser Lys Ser Leu Ser Asn Arg He 
90 95 100 105 



15 



20 



40 



45 



SO 



402 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

2$ (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Met Ala Gin Val Ser Arg He Cys Asn Gly Val Gin Asn Pro Ser Leu 
30 i 5 10 15 

He Ser Asn Leu Ser Lye Ser Ser Gin Arg Lye Ser Pro Leu Ser Val 
20 25 30 

35 Ser Leu Lye Thr Gin Gin Hie Pro Arg Ala Tyr Pro He Ser Ser Ser 

35 40 45 

Trp Gly Leu Lys Lys Ser Gly Met Thr Leu He Gly Ser Glu Leu Arg 
50 55 60 



Pro Leu Lys Val Met Ser Ser Val Ser Thr Ala Glu Lys Ala Ser Glu 
65 70 75 80 

He Val Leu Gin Pro He Arg Glu He Ser Gly Leu He Lys Leu Pro 
85 90 95 

Gly Ser Lys Ser Leu Ser Asn Arg lie 
100 105 



(2) INFORMATION FOR SEQ ID NO:14: 
(i) SEQUENCE CHARACTERISTICS: 

5 $ (A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

5 (A) NAME/KEY: CDS 

(B) LOCATION: 14..232 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

'0 AGATCTTTCA AGA ATG CCA CAA ATT AAC AAC ATG CCT CAA CGG ATA CAA 49 

Met Ala Gin lie Asn Aen Met Ala Gin Gly lie Gin 
15 10 

ACC CTT AAT CCC AAT TCC AAT TTC CAT AAA CCC CAA GTT CCT AAA TCT 97 
15 Thr Leu Asn Pro Asn Ser Asn Phe His Lye Pro Gin Val Pro Lye Ser 
15 20 25 



TCA ACT TTT CTT GTT TTT GGA TCT AAA AAA CTG AAA AAT TCA CCA AAT 145 
Ser Ser Phe Leu Val Phe Gly Ser Lys Lye Leu Lye Asn Ser Ala Asn 
30 35 40 

TCT ATG TTG GTT TTG AAA AAA GAT TCA ATT TTT ATG CAA AAG TTT TGT 193 
Ser Met Leu Val Leu Lyo Lys Asp Ser He Phe Met Gin Lys Phe Cys 
45 50 55 60 

TCC TTT AGG ATT TCA CCA TCA GTG GCT ACA CCC TGC ATG C 233 
Ser Phe Arg He Ser Ala Ser Val Ala Thr Ala Cys Met 
65 70 

(2) INFORMATION FOR SEQ ID NO:15: 
(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Met Ala Gin He Asn Asn Met Ala Gin Gly He Gin Thr Leu Asn Pro 

45 l 5 10 15 

Asn Ser Asn Phe His Lys Pro Gin Val Pro Lys Ser Ser Ser Phe Leu 
20 25 30 

50 Val Phe Gly Ser Lyo Lys Leu Lys Asn Ser Ala Asn Ser Met Leu Val 

35 40 45 



20 



25 



30 



35 



55 



Leu Lys Lys Asp Ser He Phe Met Gin Lys Phe Cys Ser Phe Arg He 
50 55 60 

Ser Ala Ser Val Ala Thr Ala Cys Met 
65 70 
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(2) INFORMATION FOR SEQ ID NO:16: 
(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 (jj) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
15 (B) LOCATION: 49.. 351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 



AGATCTGCTA CAAATAATTT TGTTTAACTT TAAGAAGGAC ATATATCC ATG GCA CAA 57 

Met Ala Gin 
1 

ATT AAC AAC ATG GOT CAA GGG ATA CAA ACC CTT AAT CCC AAT TCC AAT 105 
lie Asn Aan Met Ala Gin Gly lie Gin Thr Leu Asn Pro Asn Ser Asn 
5 10 15 

TTC CAT AAA CCC CAA GTT CCT AAA TCT TCA AGT TTT CTT GTT TTT GGA 153 
Phe His Lye Pro Gin Val Pro Lye Ser Ser Ser Phe Leu Val Phe Gly 
20 25 30 35 

TCT AAA AAA CTG AAA AAT TCA CCA AAT TCT ATG TTG GTT TTG AAA AAA 201 
Ser Lya Lys Leu Lys Asn Ser Ala Asn Ser Met Leu Val Leu Lys Lya 
40 45 50 

GAT TCA ATT TTT ATG CAA AAG TTT TGT TCC TTT AGG ATT TCA CCA TCA 249 
Asp Ser lie Phe Met Gin LyB Phe Cys Ser Phe hrg He Ser Ala Ser 
55 60 65 

40 CTG CCT ACA GCA CAG AAC CCT TCT GAG ATA CTG TTG CAA CCC ATT AAA 297 

Val Ala Thr Ala Gin Lys Pro Ser Glu lie Val Leu Gin Pro He Lys 
70 75 80 

GAG ATT TCA CGC ACT GTT AAA TTG CCT GCC TCT AAA TCA TTA TCT AAT 345 
45 Glu He Ser Gly Thr Val Lye Leu Pro Gly Ser Lys Ser Leu Ser Asn 
85 90 95 

AGA ATT C 352 
Arg He 

so i 00 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO:17: 
55 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
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20 



25 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

Met Ala Gin He Asn Asn Met Ala Gin Gly He Gin Thr Leu Afln Pro 
15 10 15 

Asn Ser Aen Phe His Lys Pro Gin Val Pro Lye Ser Set Ser Phe Leu 
20 25 30 

Val Phe Gly Ser Lys Lys Leu Lys Asn Ser Ala Asn Ser Met Leu Val 
35 40 45 

Leu Lys Lys Asp Ser He Phe Met Gin Lys Phe Cys Ser Phe Arg He 
50 55 60 

Ser Ala Ser Val Ala Thr Ala Gin Lys Pro Ser Glu He Val Leu Gin 
65 70 75 80 

Pro He Lys Glu He Ser Gly Thr Val Lys Leu Pro Gly Ser Lys Ser 
65 90 95 

Leu Ser Asn Arg lie 
100 



(2) INFORMATION FOR SEQ ID NO:18: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

3S (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Xaa His Gly Ala Ser Ser Arg Pro Ala Thr Ala Arg Lys Ser Ser Gly 
15 10 15 

45 Leu Xaa Gly Thr Val Arg He Pro Gly Asp Lys Met 

20 25 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

& (C) STRAN DEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



57 
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15 



20 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

Ala Pro Ser Met lie Asp Glu Tyr Pro He Leu Ala Val 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

He Thr Gly Leu Leu Glu Gly Glu Asp Val He Asn Thr Gly Lys 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO:21: 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

ATCATHGAYG ARTAYCC 17 



(2) INFORMATION FOR SEQ ID NO:22: 
40 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
« (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GARGAYGTNA THAACAC 17 



(2) INFORMATION FOR SEQ ID NO:23: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

CARCAYGTNA THAATAC 17 

10 (2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 
is (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

CCTCGATAGA TCTAGG AAG A CAACCATGCC TCACGCTC 38 

25 (2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

G G A TAG AT T A AGGAAGACGC GCATGCTTCA CCCTCCAAGC AGCC 44 

40 

(2) INFORMATION FOR SEQ ID NO:26: 
(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

GGCTCCCTGA TCACCTCCAC AATCGCCATC GATGG 35 

55 

(2) INFORMATION FOR SEQ ID NO:27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

CGTCCCTCCT CCTGCCTCGC CCCCCTCACC CC 



(2) INFORMATION FOR SEQ ID No:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

CGGCCAACGC CATGCAGGCT ATGGGCGCC 



(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

CGGGCTCCCG CCTCACTATG CGCCTCGTCG G 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID No:30: 
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Xaa His Ser Ala Ser Pro Lya Pro Ala Thr Ala Arg Arg Ser Clu 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO:31 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
GCCGTBGCSC CYTT5CG 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Pro Cly Asp Lye Ser He Ser His Arg Ser Phe Met Phe Cly Gly Leu 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Leu Asp Phe Cly Asn Ala Ala Thr Gly Cys Arg Leu Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO:34: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CCGCAATCCC GCCACCGGCG CGCGCC 



(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

GGACGGCTGC TTGCACCGTG AAGCATGCTT AAGCTTGGCG TAATCATGG 



(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
GGAAGACGCC CAGAATTCAC GGTCCAACCA GCCCG 



Claims 

1. An isolated DNA sequence encoding a Class II EPSPS enzyme, said enzyme being an EPSPS enzyme having a 
K m for phosphoenolpyruvate (PEP) between 1 -150 uJvl and a Kj(glyphosate)/K m (PEP) ratio between 3-500, which 
enzyme is capable of reacting with antibodies raised against a Class II EPSPS enzyme selected from the group 
consisting of the enzymes of SEQ ID NO:3 and SEQ ID NO:5. 

2. An isolated DNA sequence of Claim 1 encoding a Class II EPSPS enzyme having a li^ for phosphoenolpyruvate 
(PEP) between 1-150 uJvl and a Kj(glyphosate)/K m (PEP) ratio between 3-500, said enzyme having the sequence 
domains: 
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-D-K-S^I^S-H-R-S-P-M-F-G-G-L-A-S^J-E-T-R-1-T-G.L-l^E-O.E-D-V.I. 
N-TVG- ; 

5 -R-L-T-M-G-L-V-C- ; 

-G-D-A-S-L-Xx-K-R-P-M^-R-V-L-N-P-L-R-E-M-G-V-Q-V-, wherein 

X^S.T; 

10 -M/T-R-D-H-T-E-K-M-L-Q-G-F-G-A- ; and 

-rVA-P.S-M-I-D-E^Y-P-X«-L-A-X}-A-A-Xi-F-. t 

wherein X^V.I; X a =V,l; X^A.S. 
'5 3. A DNA sequence of claim 1 wherein said K m for phosphoenolpyruvate is between 2-25 u,M. 

4. A DNA sequence of claim 1 wherein said K/K m ratio is between 6-250. 

5. An isolated DNA sequence encoding a protein which exhibits EPSPS activity wherein said protein is capable of 
20 reacting with antibodies raised against a Class 1 1 EPSPS enzyme selected from the group consisting of the enzymes 

of SEQ ID NO:3 and SEQ ID NO:5. 

6. The DNA sequence of Claim 5 wherein said antibodies are raised against a Class II EPSPS enzyme of SEQ ID 

NO:3. 



25 



30 



7. A DNA sequence encoding a Class II EPSPS enzyme selected from the group consisting of SEQ ID NO:3 and 
SEQ ID NO:5. 

8. A recombinant, double-stranded DNA molecule comprising in sequence: 



a) a promoter which functions in plant cells to cause the production of an RNA sequence; 

b) a structural DNA sequence that causes the production of an RNA sequence which encodes a Class II 
EPSPS enzyme capable of reacting with antibodies raised against a Class II EPSPS enzyme selected from 
the group consisting of the enzymes of SEQ ID No:3 and SEQ ID No:5; and 

35 c) a 3' non-translated region which functions in plant cells to cause the addition of a stretch of polyadenyl 

nucleotides to the 3' end of the RNA sequence 

where the promoter is heterologous with respect to the structural DNA sequence and adapted to cause sufficient 
expression of the fusion polypeptide to enhance the glyphosate tolerance of a plant cell transformed with said DNA 
40 molecule. 

9. The DNA molecule of Claim 8 in which said structural DNA sequence encodes a fusion polypeptide comprising 
an amino-terminal chloroplast transit peptide and a Class II EPSPS enzyme. 

45 10. The DNA molecule of Claim 9 wherein said structural DNA sequence encoding a Class II EPSPS enzyme is se- 
lected from the group consisting of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6. 

11. The DNA molecule of Claim 10 wherein said sequence is from SEQ ID NO:2. 

50 12. A DNA molecule of Claim 9 in which the promoter is a plant DNA virus promoter. 

1 3. A DNA molecule of Claim 1 2 in which the promoter is selected from the group consisting of CaMV35S and FMV35S 
promoters. 

55 14. A DNA molecule of Claim 8 in which said structural DNA encodes a Class II EPSPS enzyme selected from the 
group consisting of SEQ ID NO:3 and SEQ ID NO:5. 

15. A method of producing genetically transformed plants which are tolerant toward glyphosate herbicide, comprising 
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the steps of: 

a) inserting into the genome ot a plant cell a recombinant, double-stranded DNA molecule comprising: 

5 i) a promoter which functions in plant cells to cause the production of an RNA sequence, 

ii) a structural DNA sequence that causes the production of an RNA sequence which encodes a 5 fusion 
polypeptide comprising an amino terminal chloroplast transit peptide and a Class II EPSPS enzyme ca- 
pable of reacting with antibodies raised against a Class II EPSPS enzyme selected from the group con- 
sisting of the enzymes of SEQ ID NO:3 and SEQ ID NO:5, 

10 iij) a 3' non-translated DNA sequence which functions in plant cells to cause the addition of a stretch of 

polyadenyl nucleotides to the 3' end of the RNA sequence 

where the promoter is heterologous with respect to the structural DNA sequence and adapted to cause suffi- 
cient expression of the fusion polypeptide to enhance the glyphosate tolerance of a plant cell transformed with 
15 said gene; 

b) obtaining a transformed plant cell; and 

c) regenerating from the transformed plant cell a genetically transformed plant which has increased tolerance 
to glyphosate herbicide. 

20 16. The method of Claim 15 wherein said structural DNA sequence encoding a Class II EPSPS enzyme is selected 
from the group consisting of SEQ ID NO:2, SEQ ID NO:4, and SEQ ID NO:6. 

17. The DNA molecule of Claim 16 wherein said sequence is that as set forth in SEQ ID NO:2. 

25 18. A method of Claim 15 in which the promoter is from a plant DNA virus. 

19. A method of Claim 1B in which the promoter is selected from the group consisting of CaMV35S and FMV35S 
promoters. 

30 20. A method of claim 15 in which said structural DNA encodes a Class II EPSPS enzyme selected from the group 
consisting of SEQ ID NO:3 and SEQ ID NO:5. 

21. A glyphosate tolerant plant cell comprising a DNA molecule of Claims 9, 10, 13 or 14. 

35 22. A glyphosate tolerant plant cell of Claim 21 in which the promoter is a plant DNA virus promoter. 

23. A glyphosate tolerant plant cell of Claim 22 in which the promoter is selected from the group consisting of CaMV35S 
and FMV35S promoters. 

40 24. A glyphosate tolerant plant eel I of Claim 21 selected from the group consisting of corn, wheat, rice, soybean, cotton, 
sugarbeet, oilseed rape, canola, flax, sunflower, potato, tobacco, tomato, alfalfa, poplar, pine, apple and grape. 

25. A glyphosate tolerant plant comprising plant cells of Claim 21 . 

4 $ 26. A glyphosate tolerant plant of Claim 25 in which the promoter is from a DNA plant virus promoter. 

27. A glyphosate tolerant plant of Claim 26 in which the promoter is selected from the group consisting of CaMV35S 
and FMV35S promoters. 

50 28. A glyphosate tolerant plant of Claim 25 selected from the group consisting of corn, wheat, rice, soybean, cotton, 
sugarbeet, oilseed rape, canola, flax, sunflower, potato, tobacco, tomato, alfalfa, poplar, pine, apple and grape. 

29. A method for selectively controlling weeds in a field containing a crop having planted crop seeds or plants com- 
prising the steps of: 

55 

a) planting said crop seeds or plants wh ich are glyphosate tolerant as a result of a recombinant double-stranded 
DNA molecule being inserted into said crop seed or plant, said DNA molecule having: 
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i) a promoter which functions in plant cells to cause the production of an RNA sequence, 

ii) a structural DNA sequence that causes the production of an RNA sequence which encodes a polypep- 
tide which comprises an amino terminal chloroplast transit peptide and a Class II EPSPS enzyme capable 
of reacting with antibodies raised against a Class II EPSPS enzyme selected from the group consisting 

s of the enzymes of SEQ ID NO:3 and SEQ ID NO:5, 

iii) a 3* non-translated DNA sequence which functions in plant cells to cause the addition of a stretch of 
polyadenyl nucleotides to the 3' end of the RNA sequence 

where the promoter is heterologous with respect to the structural DNA sequence and adapted to cause suffi- 
10 cient expression of the fusion polypeptide to enhance the glyphosate tolerance of a plant cell transformed with 

said gene; and 

b) applying to said crop and weeds in said field a sufficient amount of glyphosate herbicide to control said 
weeds without significantly affecting said crop. 

^5 30. The method of Claim 29 wherein said structural DNA sequence encoding a Class II EPSPS enzyme is selected 
from the sequences as set forth in SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. 

31. A method of Claim 30 in which said DNA molecule contains a structural DNA sequence from SEQ ID NO:2. 

20 32. A method of Claim 31 in which said DNA molecule further comprises a promoter selected from the group consisting 
of the CaMV35SS and FMV35S promoters. 

33. A method of Claim 32 in which the crop plant is selected from the group consisting of com, wheat, rice, soybean, 
cotton, sugarbeet, oilseed rape, canola, flax, sunflower, potato, tobacco, tomato, alfalfa, poplar, pine, apple and 

25 grape. 

34. The method of claim 29 wherein said structural DNA sequence encodes a Class II EPSPS enzyme selected from 
the group consisting of SEQ ID NO:3and SEQ ID NO:5. 

30 

Patentanspruche 

1. Isolierte DNA-Sequenz, welche fur ein EPSPS-Enzym der Klasse II codiert, welches Enzym ein EPSPS-Enzym 
mit einer K m fur Phosphoenolpyruvat (PEP) zwischen 1 und 1 50 u.M und einem Kj(Glyphosat)/K m (PEP)-Verhaltnis 

3S zwischen 3 und 500 ist, welches Enzym mit Antikorpern gegen ein EPSPS-Enzym der Klasse II, ausgewahlt aus 

der Gruppe bestehend aus den Enzymen der SEQ ID NO:3 und SEQ ID NO:5, reagieren kann. 

2. Isolierte DNA-Sequenz nach Anspruch 1, die fur ein EPSPS-Enzym der Klasse II mit einer K m fOr Phosphoenol- 
pyruvat (PEP) zwischen t und 150 u.M und einem Kj(Glyphosat)/K m (PEP)-Verhaltnis zwischen 3 und 500 codiert, 

40 welches Enzym die Sequenzdomanen: 

-D-K-S-I-S-H-R-S-F-M-F-G-G-L-A-S-G-E-T-R-I-T-G-L-L-E-G-E-D-V-I- 
N-T-G-; 
45 -R-L-T-M-G-L-V-G- ; 

-G-D-A-S-L-X 1 -K-R-P-M-G-R-V-L-N-P-L-R-E-M-G-V-Q-V-, worin 
X 1= S,T; 

-M-T-R-D-H-T-E-K-M-L-Q-G-F-G-A- ; und 

so 

-R-A-P-S-M-I-D-E-Y-P-X -L-A-X 3 -A-A-X 4 -F- , 

worin X 2 =V t l; X 3 =V,I; X 4 =A,S 
aufweist. 

55 3. DNA-Sequenz nach Anspruch 1 , worin die K m fur Phosphoenolpyruvat zwischen 2 und 25 |iM ist. 
4. DNA-Sequenz nach Anspruch 1, worin das K/K m -Verhaltnis zwischen 6 und 250 ist. 
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5. Isolierte DNA-Sequenz, die fur ein Protein codiert, welches EPSPS-Aktivitat zeigt, wobei das Protein mit Antikor- 
pern gegen ein EPSPS-Enzym der Klasse II, ausgewahlt aus der Gruppe bestehend aus den Enzymen der SEQ 
ID NO:3 und SEQ ID NO:5, reagieren kann. 

6. DNA-Sequenz nach Anspruch 5, wobei die Antikdrper gegen ein EPSPS-Enzym der Klasse It der SEQ ID NO.3 
sind. 

7. DNA-Sequenz, die fur ein EPSPS-Enzym der Klasse II, ausgewahlt aus der Gruppe bestehend aus SEQ ID NO: 
3 und SEQ ID NO:5 codiert. 

8. Rekombinantes, doppelstrangiges DNA-Molekul, welches in Sequenz aufweist: 



a) einen Promoter, der in Pflanzenzellen zur Bewirkung der Produktion einer RNA-Sequenz fungiert; 

b) eine strukturelle DNA-Sequenz, die die Produktion einer RNA-Sequenz bewirkt, welche fur ein EPSPS- 
*5 Enzym der Klasse II codiert, welches mit Antikorpern gegen ein EPSPS-Enzym der Klasse II, ausgewahlt aus 

der Gruppe bestehend aus den Enzymen SEQ ID NO:3 und SEQ ID NO:5, reagieren kann, und 

c) einen 3' nicht-translatierten Bereich, welcher in Pflanzenzellen zur Bewirkung der Addition eines Abschnittes 
von Potyadenylnucleotiden am 3'-Ende der RNA-Sequenz fungiert, 

20 wobei der Promotor in bezug auf die strukturelle DNA-Sequenz heterolog ist und so beschaffen ist, daB er eine 

ausreichende Expression des Fusionspolypeptids bewirkt, um die Glyphosat-Toleranz einer mit diesem DNA-Mo- 
lekul transformierten Pflanzenzelle zu verbessern. 

9. DNA-Molekul nach Anspruch 8, in welchem die strukturelle DNA-Sequenz fur ein Fusionspolypeptid codiert, das 
25 ein Amino-terminales Chloroplasten-Transitpeptid und ein EPSPS-Enzym der Klasse II aufweist. 

10. DNA-Molekul nach Anspruch 9, wobei die fur ein EPSPS-Enzym der Klasse II kodierende strukturelle DNA-Se- 
quenz ausgewahlt ist aus der Gruppe bestehend aus SEQ ID NO:2, SEQ ID NO:4 und SEQ ID NO:6. 

30 11. DNA-Molekul nach Anspruch 10, wobei diese Sequenz aus SEQ ID NO:2 ist. 

12. DNA-Molekul nach Anspruch 9, wobei der Promotor ein Pflanzen-DNA-Virus-Promotor ist. 

13. DNA-Molekul nach Anspruch 12, in welchem der Promotor ausgewahlt ist aus der Gruppe bestehend aus 
35 CaMV35S und FMV35S-Promotoren. 

14. DNA-Molekul nach Anspruch 8, in welchem diese strukturelle DNA fur ein EPSPS-Enzym, ausgewahlt aus der 
Gruppe bestehend aus SEQ ID NO:3 und SEQ ID NO.5, codiert. 

*o 15. Verfahren zur Herstellung genetisch transformierter Pflanzen, die eine Toleranz gegenuber Glyphosat-Herbizid 
aufweisen, welches die Schritte umfaBt: 

a) Inserieren eines rekombinanten, doppelstrangigen DNA-MolekGIs in das Genom einer Pflanzenzelle, wobei 
das DNA-Molekul umfaGt: 

45 

i) einen Promotor, der in Pflanzenzellen zur Bewirkung der Produktion einer RNA-Sequenz fungiert, 

ii) eine strukturelle DNA-Sequenz, welche die Produktion einer RNA-Sequenz bewirkt, die fur ein Fusi- 
onspolypeptid codiert, welches ein Amino-terminales Chloroplasten-Transitpeptid und ein EPSPS-Enzym 
der Klasse II umfaBt, das mit Antikorpern gegen ein EPSPS-Enzym, ausgewahlt aus der Gruppe beste- 

50 hend aus den Enzymen der SEQ ID NO:3 und SEQ ID NO:5, reagieren kann, 

iii) eine 3'-nicht-trans!atierte DNA-Sequenz, die in Pflanzenzellen zur Bewirkung der Addition eines Ab- 
schnitts von Polyadenylnucleotiden an das 3'-Ende der RNA-Sequenz fungiert, 

wobei der Promotor in bezug auf die strukturelle DNA-Sequenz heterolog ist und so beschaffen ist, daft er 
55 eine ausreichende Expression des Fusionspolypeptids bewirkt, um die Glyphosat-Toleranz einer mit diesem 

Gen transformierten Pflanzenzelle zu verbessern; 

b) Erhalten einer transformierten Pflanzenzelle; und 

c) Regenerieren einer genetisch transformierten Pflanze aus der transformierten Pflanzenzelle, welche Pflan- 
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ze eine erhohte Toleranz gegenOber Glyphosat-Herbizid aufweist. 

16. Verfahren nach Anspruch 15, worin die fur ein EPSPS-Enzym der Klasse II codierende DNA-Sequenz ausgewahlt 
ist aus der Gruppe bestehend aus SEQ ID NO:2, SEQ ID NO:4 und SEQ ID NO:6. 

5 

17. DNA-Molekul nach Anspruch 16, worin die Sequenz jene ist, wie sie in SEQ ID NO:2 angefuhrt ist. 

18. Verfahren nach Anspruch 15, in welchem der Promotor aus einem Pflanzen-DNA-Virus stammt. 

10 1 9. Verfahren nach Anspruch 1 8, in welchem der Promotor ausgewahlt ist aus der Gruppe bestehend aus CaMV35S- 
und FMV35S-Promotoren. 

20. Verfahren nach Anspruch 15, in welchem die strukturelle DNA fur ein EPSPS-Enzym der Klasse II, ausgewahlt 
aus der Gruppe bestehend aus SEQ ID NO:3 und SEQ ID NO:5, codiert. 

15 

21. Glyphosat-toierante Pflanzenzelle umfassend ein DNA-Molekul der Ansprflche 9, 10, 13 oder 14. 

22. Glyphosat-toierante Pflanzenzelle nach Anspruch 21 , in welcher der Promotor ein Pflanzen-DNA-Virus-Promotor 
ist. 

20 

23. Glyphosat-toierante Pflanzenzelle nach Anspruch 22, in welcher der Promotor ausgewahlt ist aus der Gruppe 
bestehend aus CaMV35S- und FMV35S-Promotoren. 

24. Glyphosat-toierante Pflanzenzelle nach Anspruch 21 , ausgewahlt aus der Gruppe bestehend aus Mais, Weizen, 
25 Reis, Sojabohne, Baumwolle, Zuckerrube, Olsaatenraps, Canola, Flachs, Sonnenblume, Kartoffel, Tabak, Tomate, 

Luzerne, Pappel, Kiefer, Apfet und Traube. 

25. Glyphosat-toierante Pflanze umfassend Pflanzenzellen nach Anspruch 21. 

30 26. Glyphosat-toierante Pflanze nach Anspruch 25, in welcher der Promotor von einem DNA-Pflanzen-Virus-Promotor 
stammt. 

27. Glyphosat-toierante Pflanze nach Anspruch 26, in welcher der Promotor ausgewahlt ist aus der Gruppe bestehend 
aus CaMV35S- und FMV35S-Promotoren. 

35 

28. Glyphosat-toierante Pflanze nach Anspruch 25, ausgewahlt aus der Gruppe bestehend aus Mais, Weizen, Reis, 
Sojabohne, Baumwolle, Zuckerrube, Olsaatenraps, Canola, Flachs, Sonnenblume, Kartoffel, Tabak, Tomate, Lu- 
zerne, Pappel, Kiefer, Apfel und Traube. 

40 29. Verfahren zur selektiven Unkrautbekampfung auf einem Feld, welches eine Bebauung mit gepflanzten Samen 
oder Pflanzen von Nutzpflanzen enthalt, umfassend die Schritte 

a) Pflanzung dieser Nutzpflanzen -Sam en oder Pflanzen, die Glyphosat-tolerant sind, weil ein rekombinantes 
doppelstrangiges DNA-Molekul in den Samen oder die Pflanze der Nutzpflanze inseriert ist, wobei das DNA- 

45 Molekul aufweist: 

i) einen Promotor, der in Pflanzenzellen zur Bewirkung der Produktion einer RNA-Sequenz fungiert, 

ii) eine strukturelle DNA-Sequenz, welche die Produktion einer RNA-Sequenz bewirkt, die fur ein Poly- 
peptid codiert, welches ein Amino-terminales Chloroplasten-Transitpeptid und ein EPSPS-Enzym der 

50 Klasse II umfaBt, das mit Antikorpern gegen ein EPSPS-Enzym der Kiasse II, ausgewahlt aus der Gruppe 

bestehend aus den Enzymen der SEQ ID NO:3 und SEQ ID NO:5, reagieren kann, 

iii) eine 3'-nicht-translatierte DNA Sequenz, die in Pflanzenzellen zur Bewirkung der Addition eines Ab- 
schnitts von Polyadenylnucleotiden an das 3'-Ende der RNA-Sequenz fungiert, 

55 wobei der Promotor in bezug auf die strukturelle DNA-Sequenz heterobg ist und so beschaffen ist, daft er 

eine ausreichende Expression des Fusionspolypeptids bewirkt, um die Glyphosat-Toleranz einer mit diesem 
Gen transformierten Pflanzenzelle zu verbessern; und 

b) Auftragen einer ausreichenden Menge von Glyphosat-Herbizid auf die Nutzpflanzen und das Unkraut in 
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diesem Fetd, urn dieses Un kraut zu bekampfen, ohne die Nutzpflanzen wesentlich zu beeintrachtigen. 

30. Verfahren nach Anspruch 29, wobei die fur ein EPSPS-Enzym der Klasse II codierende strukturelle DNA-Sequenz 
ausgewahlt ist aus den Sequenzen, wie in SEQ ID NO:2, SEQ ID NO:4 Oder SEQ ID NO: 6 angefuhrt. 

5 

31. Verfahren nach Anspruch 30, in welchem das DNA-Molekul eine strukturelle DNA-Sequenz von SEQ ID NO:2 
enthalt. 

32. Verfahren nach Anspruch 31, in welchem das DNA-Molekul weiters einen Promotor, ausgewahlt aus der Gruppe 
10 bestehend aus den CaMV35SS- und FMV35S-Promotoren, umfafM. 

33. Verfahren nach Anspruch 32, in welchem die Nutzpflanze ausgewahlt ist aus der Gruppe bestehend aus Mais, 
Weizen, Reis, Sojabohne, Baumwolle, Zuckerrube, Olsaatenraps, Canola, Flachs, Sonnenblume, Kartoffel, Tabak, 
Tomate, Luzeme, Pappel, Kiefer, Apfel und Traube. 

1S 

34. Verfahren nach Anspruch 29, wobei die strukturelle DNA-Sequenz fur ein EPSPS-Enzym der Klasse II, ausgewahlt 
aus der Gruppe bestehend aus SEQ ID NO:3 und SEQ ID NO:5, codiert. 



20 Revendications 

1 . Sequence d'ADN isolde codant pour une enzyme EPSPS de la Classe 1 1 , cette enzyme etant une enzyme EPSPS 
ayant un K m pour le phosphoeno (pyruvate (PEP) compris entre entre 1 et 150 u.M et un rapport Kj(glyphosate)/K m 
(PEP) compris entre 3 et 500, laquelle enzyme est capable de reagir avec des anticorps formes contre une enzyme 

25 EPSPS de la Classe II choisie parmi les enzymes de SEQ ID NO:3 et SEQ ID NO:5. 

2. Sequence d'ADN isolee selon la revendication 1, codant pour une enzyme EPSPS de la Classe II, ayant un K,^ 
pour le phosphoenolpyruvate (PEP) compris entre 1 et 150 u-M et un rapport Kj (glyphosate)/K m (PEP) compris 
entre 3 et 500, cette enzyme ayant les domaines de sequences : 

30 

-D-K-S-I-S-H-R-S-F-M-F-G-G-L-A-S-G-E-T-R-I- 
T-G-L-L-E-G-E-D-V-I-N-T-G- ; 

-R-L-T-M-G-L-V-G- ; 

-G-D-A-S-L-Xi-K-R-P-M-G-R-V-L-N-P-L-R-E-M-G- 
V-Q-V-, dans lequel Xi=S,T; 

-M-T-R-D-H-T-E-K-M-L-Q-G-F-G-A- ; et 
-R-A-P-S-M-I-D-E-Y-P-X2-L-A-X3-A-A-X4-F-, 

dans lequel X 2 = V,l; X 3 = V,l; >Q = A,S. 

3. Sequence d'ADN selon la revendication 1 , dans laquelle ce !<„, pour le phosphoenolpyruvate est compris entre 2 
et 25 uM 

4. Sequence d'ADN selon la revendication 1 , dans laquelle ce rapport K/K m est compris entre 6 et 250. 

5. Sequence d'ADN isol6e codant pour une proteins qui presente une activity d'EPSPS dans laquelle cette proline 
est capable de reagir avec des anticorps form 6s contre une enzyme EPSPS de la Classe II choisie parmi les 

so enzymes de SEQ ID NO:3 et SEQ ID NO:5. 

6. Sequence d'ADN selon la revendication 5, dans laquelle ces anticorps sont formes contre une enzyme EPSPS 
de la Classe II de SEQ ID NO: 3. 

55 7. Sequence d'ADN codant pour une enzyme EPSPS de la Classe II choisie parmi SEQ ID NO:3 et SEQ ID NO:5. 

8. Molecule d'ADN a double brin, recombinante, comprenant successivement : 



35 



40 



45 
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a) un promoteur qui fonctionne dans tes cellules v6g6tales en provoquant la production d'une sequence d'ARN; 

b) une sequence d'ADN structural qui provoque la production d'une sequence d'ARN qui code pour une en- 
zyme EPSPS de la Classe II capable de reagir avec des anticorps form6s contre une enzyme EPSPS de la 
Classe II choisie parmi les enzymes SEQ ID NO:3 et SEQ ID NO:5; 

s c) une region non traduite 3* qui fonctionne dans les cellules v6getales en provoquant I'addition d'un segment 

de polyad6nyle nucleotides a I'extr6mit6 3* de la sequence d'ARN, 

dans laquelle le promoteur est necrologue par rapport a la sequence d'ADN structural et capable de provoquer 
une expression du polypeptide de fusion suffisante pour augmenter la tolerance au glyphosate d'une cellule v6- 
10 getale transformee par cette molecule d'ADN. 

9. Molecule d'ADN selon la revendication 8, dans laquelle cette sequence d'ADN structural code pour un polypeptide 
de fusion comprenant un peptide de transit dans les chloroplastes ami no-terminal et une enzyme EPSPS de la 
Classe II. 

75 

10. Molecule d'ADN selon la revendication 9, dans laquelle cette sequence d'ADN structural codant pour une enzyme 
EPSPS de la Classe II est choisie parmi SEQ ID NO:2, SEQ ID NO:4 et SEQ ID NO:6. 

11. Molecule d'ADN selon la revendication 10, dans laquelle cette sequence provient de SEQ ID NO:2. 

20 

12. Molecule d'ADN selon la revendication 9, dans laquelle le promoteur est un promoteur de virus a ADN vegetal. 

13. Molecule d'ADN selon la revendication 12, dans laquelle le promoteur est choisi parmi les promoteurs CaMV35S 
et FMV35S. 

25 

14. Molecule d'ADN selon la revendication 8, dans laquelle cet ADN structural code pour une enzyme EPSPS de la 
Classe II choisie parmi SEQ ID NO:3 et SEQ ID NO:5. 

1 5. Precede de production de plantes transformers g6nettquement qui sont toleVantes vis-a-vis de I'herbicide glypho- 
30 sate, comprenant les etapes consistant a 

a) inserer dans le genome d'une cellule vegetal e une molecule d'ADN a double brin, recombinante, 
comprenant : 

35 j) un promoteur qui fonctionne dans les cellules v6g6tales en provoquant la production d'une sequence 

d'ARN, 

ii) une sequence d'ADN structural qui provoque la production d'une sequence d'ARN codant pour un 
polypeptide de fusion qui comprend un peptide de transit dans tes chloroplastes amino-terminal et une 
enzyme EPSPS de la Classe II capable de r6agir avec des anticorps formes contre une enzyme EPSPS 

40 de la Classe II choisie parmi les enzymes SEQ ID NO:3 et SEQ ID NO:5, 

iii) une sequence d'ADN non traduite 3' qui fonctionne dans les cellules v6g6tales en provoquant I'addition 
d'un segment de polyadenyle nucleotides a Textremite 3' de la sequence d'ARN 

dans laquelle le promoteur est necrologue par rapport a la sequence d'ADN structural et est capable de 
45 provoquer une expression du polypeptide de fusion suffisante pour augmenter la tolerance au glyphosate 

d'une cellule vegetale transformee par ce gene; 

b) obtenir une cellule vegetate transformee; et 

c) regdnerer a parti r de la cellule vegetale transformee une plante transformee gen§tiquement qui presente 
une tolerance ameiioree pour I'herbicide glyphosate. 

so 

16. Procdde selon la revendication 15, dans lequel cette sequence d'ADN structural codant pour une enzyme EPSPS 
de ia Classe II est choisie parmi SEQ ID NO:2, SEQ ID NO:4 et SEQ ID NO:6. 

17. Molecule d'ADN selon la revendication 16, dans laquelle cette sequence est cette indiqu6e dans SEQ ID NO:2. 

55 

18. Proc6d6 selon la revendication 15, dans lequel le promoteur provient d'un virus a ADN v6g6tal. 

1 9. Proc6d6 selon la revendication 1 8, dans lequel le promoteur est choisi parmi les promoteurs CaMV35S et FMV35S: 
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20. Procedd selon la revendication 15, dans lequel cet ADN structural code pour une enzyme EPSPS de la Classe II 
choisie parmi SEQ ID NO:3 et SEQ ID NO:5. 

21. Cellule vegetale tolerant le glyphosate comprenant une molecule d'ADN selon les revendications 9, 10, 13 ou 14. 

s 

22. Cellule vegetale tolerant le glyphosate selon la revendication 21, dans laquelle le promoteur est un promoteur de 
virus a ADN vegetal. 

23. Cellule vegetale tolerant le glyphosate selon la revendication 22, dans laquelle le promoteur est choisi parmi les 
10 promoteurs CaMV35S et FMV35S. 

24. Cellule vegetale tolerant le glyphosate selon la revendication 21, choisie parmi le mais, le froment, le riz, le soja, 
le co ton, la betterave a sucre, le colza oleagineux, le canota, le tin, le tournesol, la pomme de terre, le tabac, la 
tomate, la luzerne, le peuplier, le pin, le pommier et la vigne. 

15 

25. Plante tolerant le glyphosate comprenant des cellules vegetales selon la revendication 21 . 

26. Plante tolerant le glyphosate selon la revendication 25, dans laquelle le promoteur est un promoteur provenant 
d'un virus vegetal a ADN. 

20 

27. Plante tolerant le glyphosate selon la revendication 26, dans laquelle le promoteur est choisi parmi les promoteurs 
CaMV35S et FMV35S. 

28. Plante tolerant le glyphosate selon la revendication 25, choisie parmi le mats, le froment, le riz, le soja, le coton, 
25 la betterave a sucre, le colza oleagineux, le canola, le lin, le tournesol, la pomme de terre, le tabac, la tomate, la 

luzerne, le peuplier, le pin, le pommier et la vigne. 

29. Precede pour lutter s6lectivement contre les mauvaises herbes dans un champ contenant une recolte dans laquelle 
sont plantes des semences ou plants, comprenant les etapes consistant a : 

30 

a) planter ces semences ou plants de recolte qui sont tolSrants vis-a-vis du glyphosate sous I'effet d'une 
molecule d'ADN a double brin, recombinante, ins6ree dans cette semence ou plant de recolte, cette molecule 
d'ADN ay ant 

3S j) un promoteur qui fonctionne dans les cellules vegetales en provoquant la production d'une sequence 

d'ARN, 

ii) une sequence d'ADN structural qui provoque la production d'une sequence d'ARN codant pour un 
polypeptide qui comprend un peptide de transit dans les chloroplastes amino-terminal et une enzyme 
EPSPS de la Classe II capable de r6agir avec des anticorps formes contre une enzyme EPSPS de la 

40 Classe II choisie parmi les enzymes SEQ ID NO:3 et SEQ ID NO: 5, 

iii) une sequence d'ADN non traduite 3' qui fonctionne dans les cellules vegetales en provoquant Paddition 
d'un segment de polyad6nyle nucleotides a rextremite 3' de la sequence d'ARN 

dans laquelle le promoteur est necrologue par rapport a la sequence d'ADN structural et est capable de 
45 provoquer une expression du polypeptide de fusion suffisante pour ameliorer la tolerance au glyphosate d'une 

cellule vegetale transformee par ce gene; et 

b) appliquer a cette recolte et a ces semences dans ce champ une quantite d'herbictde glyphosate suffisante 
pour lutter contre ces mauvaises herbes sans affecter significativement cette recolte. 

50 30. Precede selon la revendication 29, dans lequel cette sequence d'ADN structural codant pour une enzyme EPSPS 
de la Classe II est choisie parmi les sequences indiquees dans SEQ ID NO:2, SEQ ID NO:4 ou SEQ ID NO: 6. 

31. Proced6 selon la revendication 30, dans lequel cette molecule d'ADN contient une sequence d'ADN structural 
provenant de SEQ ID NO:2. 

55 

32. Precede selon la revendication 31, dans lequel cette molecule d'ADN comprend en outre un promoteur choisi 
parmi les promoteurs CaMV35SS et FMV35S. 
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33. Precede selon ta revendication 32, dans lequel la plante de r6colte est choisie parmi le mais, le f roment, le riz, le 
soja, le coton, la betterave a sucre, le colza oleagineux, le canola, le lin, le toumesol, la pomme de terre, le tabac, 
la tomate, la luzerne, le peuplier, le pin, le pommier et la vigne. 

34. Proced6 selon la revendication 29, dans lequel cette sequence d'ADN structural code pour une enzyme EPSPS 
de la Classe II choisie parmi SEQ ID NO:3 et SEQ ID NO:5. 
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